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CYTOPLASMIC MODULATORS 
OF INTEGRIN BINDING 

This application is a continuation-in-part of co-pending U.S. 

Patent Application Serial No. 08/583,562 which was filed on January 5, 1996. 



5 Field of the Invention 

The present invention relates generally to filamine-like integrin 
binding proteins and more particularly to the cloning and expression of a 
novel filamine-like protein, FLP-1. 



Background 

10 A significant characteristic of the immune and inflammatory 

responses is the movement of leukocytes from the bloodstream into specific 
tissues in response to various physiological signals. For example, certain 
subsets of lymphocytes "home" to various secondary lymphoid tissues such as 
lymph nodes or Peyer's patches, and eventually return to circulation. Other 

15 leukocytes such as granulocytes and monocytes, however, do not return to 
circulation after transmigration from the bloodstream. Movement of 
leukocytes from circulation is effected by a series of receptor/counter-receptor 
interactions which are coordinated by various specific membrane adhesion 
molecules. 

20 Extravasation of leukocytes from the bloodstream [for review, 

see McEver, Cum Opin. Cell Biol. 4:840-849 (1992)] is initially effected by 
a family of membrane glycoproteins termed selectins which are either 
expressed constitutive! y or induced in response to specific cytokines. Binding 
of selectins to their counterpart ligand brings leukocytes into close, but not 

25 static, contact with vascular endothelial cells. The "tethered" leukocyte then 
begins a "rolling" process along the endothelium which continues until 
additional molecular interactions firmly stabilize a specific cell/cell interaction. 
One of the molecular binding activities which results in the stable interaction 
is effected by a second family of surface glycoproteins called integrins which 

30 possess a higher binding affinity for their respective ligands than selectins. 
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The integrins are heterodimeric surface molecules comprised of 
an a and a /? subunit in non-covalent association. All integrins are 
transmembrane proteins with counter-receptor binding activity localized in the 
extracellular domain, Integrins also possess relatively short cytoplasmic 
5 regions which participate in transmembrane signaling events. Integrins are 
capable of interacting with other cell-bound counter- receptors and components 
of the extracellular matrix, as well as soluble factors. Binding of extracellular 
ligands leads to crosslinking and localized clustering of integrins [Miyamoto, 
et al, Science 267:833, 1995] and formation of focal adhesions wherein the 

10 clustered integrin cytoplasmic domains associate with cytoskeletal components 
including, for example, actin filaments [Pavalko and Otey, Proc. Soc. Exp. 
Biol. Med. 205:32767, 1994, and Gumbiner, Neuron 11:551, 1993]. While 
most investigations into integrin physiological activity have focused on 
identifying specific counter-receptors using immunological methodologies as 

15 discussed infra, less is known about the specific interactions of integrins with 
cytoplasmic components. Mutation studies, however, have indicated that the 
cytoplasmic sequences are required for integrin association with focal contacts 
and integrin dependent cell adhesion [LaFlamme, et al. , J. Cell. Biol. 117:437 
(1992)]. Other data discussed infra support this observation. 

20 While numerous integrins have been identified, certain subsets 

are unique to leukocytes, with each member of the subset having characteristic 
cell-specific expression and counter-receptor binding properties. Of leukocyte- 
specific integrins, at least three (t 2 integrins are known, each comprised of a 
unique a subunit in association with a @ 2 subunit (designated CD 18) 

25 [Kishimoto, et aL, Cell 48:681-690 (1987)]. For a recent review of the state 
of the an with regard to f3 2 integrins, see Springer, Cell 76:301-314 (1994), 
CDlla/CD18, also known as aJ3 2 or LFA-1, is expressed on all leukocytes 
and has been shown to bind to ICAM-I, ICAM-2, and ICAM-3. 
CDllb/CD18, also know as a M 0 2 or Mac-1, is expressed on 

30 polymorphonuclear neutrophils, monocytes and eosinophils and has been 
shown to bind to ICAM-1, complement factor iC3b, factor X, and fibrinogen. 
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CDllc/CD18, also known as a x (3 7 or pl50,95, is expressed on monocytes, 
polymorphonuclear neutrophils and eosinophils and has been shown to bind 
to complement factor iC3b and fibrinogen. In addition, a fourth human jS 2 
integrin, designated aJ3 2 , has recently been identified [Van der Vieren, et al. , 
5 Immunity 3:683-690 (1995)]. Recently, it has been demonstrated that the 
actin-binding protein, filamin, directly binds to a cytoplasmic portion of j3 2 
subunits [Sharma, et al , 7. Immunol 154:3461-3470 (1995)] which suggests 
a role for one or more of the /3 2 integrins in formation of focal contacts and 
cell motility in general [see review in Arnaout, Blood 75:1037 (1990)]. 

10 A second subset of leukocyte specific integrins may be referred 

to as the a 4 integrins in view of the fact that both members of the family are 
comprised of a common a 4 subunit in association with either a f$ x or 0 7 
subunit. For a recent review, see Springer, supra. VLA-4, also referred to 
as a 4 j3, or CD49d/CD29, is expressed on most peripheral blood leukocytes 

15 except neutrophils and specifically binds VCAM-1 and fibronectin. LPAM-1 , 
also known as ar 4 j3 7 , is expressed on all peripheral blood leukocytes and has 
been shown to bind MadCAM-1, fibronectin and VCAM-1. Expression of r 
either of the a 4 integrins has also been demonstrated in a wide range of 
leukocyte cell types in lymphoid organs and in various tissues (Hemler et al. , 

20 Immunol. Rev. 114:45-60, 1990; Kilshaw et al., Eur. J. Immunol. 20:2201- 
2207, 1990; Schweighoffer et aL. /. Immunol. 151:717-729, 1993; and 
Lazarovits and Karsh, J. Immunol. 151:6482-6489, 1993). Consistent with 
the observed participation of (3 7 integrins in formation of focal contacts, 
presumably through filamin binding, it has previously been shown that 

25 cytoplasmic portions of 0, integrins directly bind a-actinin in vitro. While 
this interaction has not been demonstrated in vivo, it suggests physiological 
involvement of /3, integrins in cell mobility and/or maintenance of cell 
morphology [see review in Clark and Brugge, Science 268:233-238 (1995)]. 

A number of in vitro and in vivo studies utilizing anti-a 4 

30 monoclonal antibodies have indicated a role for the a 4 integrins in various 
pathophysiological conditions [see review, Lobb and Hemler, 7. Clin. Invest. 
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94:1722-1728 (1994)]. For example, several investigations have provided 
evidence that a 4 integrins are involved in leukocyte emigration from peripheral 
blood into regions of inflammation (Weg, et aL, J. Exp. Med. 177:561-566, 
1992; Winn and Harlan, 7. Clin. Invest. 92:1168-1173, 1993). These 
5 observations suggest that anti-a 4 antibodies may be capable of ameliorating 
integrin-associated disease states, and this therapeutic potential has been 
demonstrated in several animal disease state models. For example, bolus 
injection of antibodies to a 4 integrins delayed the onset of paralysis in rat and 
murine experimental allergic encephalomyelitis (Yednock, et aL, Nature 

10 356:63-66, 1992; Baron, etaL.J. Exp. Med. 177:57-68, 1993). Prophylactic 
administration of anti-a 4 antibodies reduced ear swelling in murine contact 
hypersensitivity models (Ferguson, etal.,J. Immunol. 150:1 172-1182, 1993; 
Nakajima, et aL, J. Exp. Med. 179:1145-1154, 1994). Further, anti-a 4 
antibodies were shown to reduce infiltration of pancreatic islets and delay the 

15 onset of diabetes in non-obese diabetic mice which are prone to spontaneous 
development of type I diabetes (Yang, et aL, Proc. Natl. Acad. Sci. (USA) 
90:10494-10498. 1993; Burkly, etaL, Diabetes 43:529-534, 1994; Baron, et 
aL, J. Clin. Invest. 93:1700-1708, 1994). Still other in vivo studies using 
anti-a 4 antibodies suggest a role for a 4 integrins in allergic lung inflammation 

20 (Pretolani, etaL, J. Exp. Med. 180:795-805 (1994); Milne and Piper, Br. J. 
Pharmacol. 112:82Pa(Abstr), 1994); inflammatory bowel disease (Podolsky, 
et aL,J. Clin. Invest. 92:372-380, 1993); cardiac allograft rejection (Paul, et 
aL, Transplantation 55:1196-1199, 1993); acute nephrotoxic nephritis 
(Mulligan, et aL t J. Clin. Invest. 91:577-587, 1993); and immune complex 

25 mediated lung injury (Mulligan, et aL, J. Immunol. 159:2407-2417, 1993). 

Thus there exists a need in the art to identify molecules which 
bind to and/or modulate the binding and/or signalling activities of the integrins 
and to develop methods by which these molecules can be identified. The 
methods, and the molecules thereby identified, will provide practical means 

30 for therapeutic intervention in a 4 integrin-mediated immune and inflammatory 
responses. 
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Brief Description of the Invention 

In one aspect, the present invention provides novel purified and 
isolated polynucleotides (e.g., DNA and RNA transcripts, both sense and 
antisense stands) encoding a filamin-like j8 7 integrin binding protein designated 

5 FLP-l. or variants thereof (i.e., deletion, addition or substitution analogs) 
which possess binding and/or immunological properties inherent to FLP-l. 
Preferred DNA molecules of the invention include cDNA, genomic DNA and 
wholly or partially chemically synthesized DNA molecules. Presently 
preferred polynucleotides include the DNA as set forth in SEQ ID NO: 1 , 

10 encoding the polypeptide according to SEQ ID NO: 2. Alternatively, a 
preferred polynucleotide encodes a polypeptide according to SEQ ID NO: 2 
except that the amino acid at position 146 is a proline rather than a leucine, 
the amino acid at position 442 is a proline rather than an alanine and the 
amino acid at position 548 is a valine rather than a methionine. Such a 

15 polynucleotide would hybribize to the DNA set out in SEQ ID NO: 1. 

Preferred polynucleotides of the invention comprise the cDNA 
set out in SEQ ID NO: 1 and DNAs which hybridize to the non-coding 
strands thereof under stringent conditions or which would hybridize but for the 
redundancy of the genetic code. Exemplary stringent hybridization conditions 

20 are as follows: hybridization at 42 °C in 5X SSPE and a final wash at 65 °C 
in 0.2X SSC. It is understood by those of skill in the art that variation in 
these conditions occurs based on the length and GC nucleotide content of the 
sequences to be hybridized. Formulas standard in the art are appropriate for 
determining exact hybridization conditions. See Sambrook, et al. , Eds. 9.47- 

25 9.51 in Molecular Cloning, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York (1989). 

Also provided are recombinant plasrnid and viral expression 
constructs which include FLP-l encoding sequences, wherein the FLP-l 
encoding sequence is operatively linked to a homologous or heterologous 

30 transcriptional regulatory element or elements. 

As another aspect of the invention, prokaryotic or eukaryotic 
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host cells, transformed or transfected with polynucleotide sequences of the 
invention, are provided which express FLP-1 polypeptides or variants thereof. 
Host cells of the invention are particularly useful for large scale production 
of FLP-1 polypeptides which can be isolated from the host cell itself or the 
5 medium in which the host cell is grown. 

Also provided by the present invention are purified and isolated 
FLP-1 polypeptides, including fragments and variants thereof. Novel FLP-1 
polypeptides of the invention may be isolated from natural sources, but along 
with FLP-1 variant polypeptides, are preferably produced by recombinant 

10 procedures involving host cells of the invention. Variant FLP-1 polypeptides, 
including fully glycosylated, partially glycosylated, and wholly de-glycosylated 
forms of the FLP-1 polypeptide may be generated, depending on the host cell 
selected for recombinant production and/or post-isolation processing. 
Additional variant FLP-1 polypeptides include water soluble and insoluble 

15 FLP-1 polypeptides and fragments thereof, analogs wherein one or more 
amino acids are deleted from, replaced in, or added to the preferred FLP-1 
polypeptide, polypeptide analogs with equal or enhanced biological activities 
and/or immunological characteristics specific for FLP-1, and analogs with 
modified ligand binding and/or signal transducing capabilities. Fusion 

20 polypeptides are also provided wherein FLP-1 amino acid sequences are 
expressed contiguously with amino acid sequences derived from other 
polypeptides. Fusion polypeptides of the invention include those with 
modified biological, biochemical, and/or immunological properties in 
comparison to the preferred FLP-1 polypeptide. 

25 Also contemplated by the present invention are antibodies and 

other peptide and non-peptide molecules which specifically bind to FLP-1. 
Binding molecules of this type are particularly useful for purifying FLP-1 
polypeptides, identifying cell types which express FLP-1 polypeptides, and 
assaying for presence or absence of FLP-1 polypeptides in a fluid. Binding 

30 molecules are also useful for modulating (i.e., blocking, inhibiting, or 
stimulating) in vivo binding and/or signal transduction activities of FLP-1. 
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Antibodies of the invention include monoclonal, polyclonal, and recombinant 
(i.e., humanized, chimeric, etc.) forms and fragments thereof. 

Also contemplated by the invention are hybridomas which 
secrete monoclonal antibodies specifically immunoreactive with FLP-1. 
5 Likewise, cell types modified by recombinant means so as to express and/or 
secrete genetically engineered FLP-1 binding molecules are also 
comprehended. 

Assays to identify FLP-1 binding molecules are also provided, 
including immobilized ligand binding assays, solution binding assays, 

10 scintillation proximity assays, two hybrid screening assays, immunological 
methodologies and the like. In addition to identifying FLP-1 binding 
molecules, the same or similar assays are useful for identification of molecules 
which modulate FLP-1 specific binding. For example, assays to identify 
modulators (i.e., activators or inhibitors) of FLP-1 specific binding can 

15 involve a) contacting FLP-1 or a fragment thereof, with j8 7 integrin or a 
fragment thereof; b) measuring binding between FLP-1 or a fragment 
thereof, and /3 7 integrin or a fragment thereof; c) measuring binding between 
FLP-1 or a fragment thereof, and /9 7 integrin or a fragment thereof in the 
presence of a test compound, and d) comparing the measurement in step (b) 

20 and the measurement in step (c) wherein a decrease in binding in step (c) 
indicates the test compound in an inhibitor of binding, and an increase in 
binding in step (c) indicates the test compound is an activator of binding. 

Variations on the method to identify modulators of FLP-1 
binding can include scintillation proximity assays comprising the steps of 

25 immobilizing either FLP-1 or its binding partner on a solid support, wherein 
the solid support contains a fluorescent agent; modifying the non-immobilized 
binding partner to include a compound that can excite the immobilized 
fluorescent agent; contacting the non-immobilized binding partner with the 
immobilized binding partner; determining the level of light emission for the 

30 fluorescent agent; and repeating the procedure in the presence of a putative 
modulator of FLP-1 binding. 
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As still another variation of the method, a two hybrid system 
may be utilized to identify genes encoding potential modulators. In this 
system, an integrin sequence is expressed in a host cell as a fusion protein 
with either a DNA binding domain or transactivation domain of a modular 
5 transcription factor. A binding partner protein is also expressed as a fusion 
protein with the transcription factor domain not utilized in expressing the 
integrin fusion protein. Interaction of the two fusion proteins results in 
reconstitution of the holo-transcription factor and permits expression of a 
reporter gene with a promoter specific for the transcription factor. Use of this 
10 system in the presence or absence of library cDNA can permit identification 
of genes that encode proteins which modulate the degree of reporter gene 
expression. 

Additional methods comprehended by the invention include 
immunological assays including radio-immuno assays, enzyme linked 

15 immunosorbent assays, sandwich assays and the like. Co-precipitation 
methods are also comprehended wherein an antibody immunospecific for one 
binding partner is utilized in a method in which the other binding partner is 
detectably labeled. Immunological assays may also include use of labeled 
antibodies specifically immunoreactive with a complex between the desired 

20 binding partners. 

Numerous compounds are contemplated as being candidates for 
testing in methods of the invention. For example, antibody products which 
are immunoreactive with one binding partner and which modulate binding 
between the two molecules can be identified by the claimed method. 

25 Antibody products contemplated are monoclonal antibodies, and fragments 
thereof, humanized antibodies, and/or single chain antibodies. Other 
molecules which can be screened in the claimed method include peptides, 
small molecules and libraries composed of either of the same. 

Modulators of 0 7 /FLP-l and /3 7 /filamin interaction identified by 

30 the methods of the invention are utilized in vitro or in vivo to affect 
inflammatory processes involving leukocytes. In addition, modulating 
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compounds which bind to either the 0 7 integrin, filamin or FLP-1 are useful 
to monitor the level of its binding partner, either in a body fluid or biopsied 
tissue. 

Those of ordinary skill in the art will readily appreciate that 
5 numerous variations of the claimed method are encompassed by the invention. 

Detailed Description of the Invention 

The present invention is illustrated by the following examples 
relating to the isolation of a cDNA clone encoding FLP-1 . Example 1 relates 
to identification of genes in a human B cell cDNA library that encode proteins 

10 which interact with £J 7 integrin. Example 2 describes identification of genes 
in a human spleen cDNA library which encode proteins that interact with jS 7 
integrin. Example 3 addresses tissue specific expression of FLP-1. Example 
4 describes specificity of interaction between filamin and j3 7 and FLP-1 and 
/S 7 integrin. Example 5 describes localization of /J 7 sequences required for 

15 filamin or FLP-1 binding. Example 6 relates to applications for modulators 
of /3 7 /filamin or £ 7 /FLP-l interactions. 



Example 1 

Identification of Genes in a B Cell Library 
Encoding £ 7 Interacting Proteins 

20 The two-hybrid system developed in yeast [Durfee, et aL, 

Genes and Development 7:555-567 (1993)] was used to screen for proteins 
expressed in a human B cell cDNA library which interact with the carboxy- 
terminal cytoplasmic tail of the j3 7 integrin. The yeast two-hybrid screen is 
based on in vivo reconstitution of the GAL4 transcription factor and 

25 subsequent expression of a reporter gene driven by a GAL4 promoter. 
Briefly, GAL4 DNA-binding and transcription-activating domains are encoded 
on separate plasmids as portions of fusion proteins. Expression of the fusion 
proteins and interaction of the expression products results in association of the 
two GAL4 domains and ultimate expression the /3-galactosidase reporter gene 
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under transcriptional control of the GAL4 promoter. 

In the present investigation, a "bait" plasmid (pASl) was 
constructed that contained sequences encoding the GAL4-binding domain, a 
trp selection requirement, a hemagglutinin (HA) epitope tag and cytoplasmic 
5 amino acid sequences of 0 7 integrin. The /3 7 integrin cytoplasmic domain was 
amplified by PCR using 0 7 primers set out in SEQ ID NO: 3 and 4. 

NH0 7 5 CGGATCCTCGGATACCGGCTCTCGGTGAAG (SEQ ID NO: 3) 
NH0 7 3 CG G CTCCTC AG AG AGTGGG A CTGTCTG CCT (SEQ ID NO: 4) 

Reaction conditions included an initial incubation at 94 °C for four minutes, 

10 followed by thirty cycles of: 94°C for one minute, 50°C for two minutes, and 
72 °C for four minutes. The resulting product was sequenced to rule out PCR- 
derived errors and subcloned into vector pASl. A yeast strain, Y190, was 
transformed with /? 7 /pASl by standard methods and grown in selective media 
(trp ) to mid-log phase. Cells were lysed in lysis buffer (containing 100 mM 

15 Tris, pH 6.8, 2% SDS, 10% glycerol, 5% BME and 0.1% bromo phenol 
blue) and the equivalent of 5-6 x 10 6 cells of protein was separated on a 12% 
poly aery lamide gel. Proteins in the gel were transferred to a PVDF 
(Millipore, Bedford, MA) membrane by standard methods. Control lanes on 
the gel contained lysate from Y190 cells transformed with pASl vector alone 

20 (containing no 0 7 integrin-encoding sequences). Western blotting was 
performed using antibody 12CA5, immunospecific for the HA epitope tag, 
(Boehringer Mannheim, Indianapolis, IN) and a goat anti-mouse IgG horse 
radish peroxidase (HRP) secondary antibody. Results, in combination with 
size determination using SDS-PAGE, confirmed that the fusion protein j3 7 

25 integrin cytoplasmic tail/HA/GAL4 DNA-binding domain was expressed at 
readily detectable levels. 

A "target" vector was constructed with vector pACT modified 
to contain sequences encoding the GAL4 activation domain II fused to a B cell 
cDNA library and a leu* selection requirement. Lymphocyte cDNA library 
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sequences were inserted at an Xhol site of the vector. j3 7 /pASl -transformed 
Y190 cells were transformed by standard methods with the pACT-lymphocyte 
library DNA and cells grown under selective conditions (leu /trp/his 73- 
aminotriazole). Resulting colonies were tested for /3-galactosidase 08-gal) 
5 activity by the blue/ white selection method well known in the art and forty- 
four j8-gal positive clones were obtained. Sequence analysis of the B cell 
cDNA-derived pACT inserts in each of the clones revealed twenty novel 
sequences and twenty four sequences encoding known proteins or portions of 
known proteins. 

10 Five clones were of particular interest, all of which contained 

sequences encoding a portion of the non-muscle protein filamin, or actin- 
binding protein ABP280(emb/X53416), [Gorlin, et aL , J. Cell BioL 111: 1089- 
1105 (1990)] All five clones were shown to encode the carboxy-terminal 
portions of filamin (SEQ ID NO: 7) and each clone extended into 3 ' 

15 untranslated portions of the filamin gene. Clone 411 corresponded to 
sequences in repeat 20 (beginning at nucleotide 6763 in SEQ ID NO: 7) and 
clones 514, 1521, 1271 and 722 beginning in repeat 23 (each beginning at 
nucleotide 7513, 7552, 7579, and 7579 in SEQ ID NO: 7, respectively). 
There was one discrepancy between the published sequence of filamin and the 

20 sequences determined in each of the positive clones: all positive clones had an 
aspartate residue at position 2634, while the published sequence of filamin had 
a histidine at that position. Of these clones, 1271 , 514 and 41 1 were selected 
for subsequent analysis, and the nucleotide and amino acids sequences of 1271 
are set out in SEQ ID NOs: 5 and 6, respectively. 

25 Example 2 

Identification of Genes in a Human Spleen 
Library Encoding 0 7 Interacting Proteins 

The two-hybrid system described in Example 1 was repeated 
using human spleen cDNA library sequences (Clontech, Palo Alto, CA) 
30 cloned into an EcoEl site of the target vector pGADIO (Clontech). 
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After transformation of the j3 7 /pASl Y190 strain with the 
spleen/pGADIO plasmid and selection as previously described, the resulting 
colonies were tested for 0-gal activity and six positive clones were identified. 
Sequence analysis of the six 0-gal positive clones that revealed five identical 
5 clones (from which clone S5 was selected for further analysts) along with 
clone S3, (the sixth positive clone and distinct from the other five) were 
identified. 

DNA and protein alignments revealed that clones S3 and S5 
encode different, but overlapping regions of the same protein, with the S3 

10 insert beginning 5' of the S5 insert, and terminating before the 3' end of 
clone S5. The DNA sequences of clones S3 and S5 were compared to DNA 
databases using NCBI Blastn with default parameters on October 16, 1995, 
and both clones were found to exhibit approximately 70% identity to filamin. 
The nucleotide and amino acid sequences of clone S3 are set out in SEQ ID 

15 NOs: 9 and 10, respectively. Sequences for clone S5 are set out in SEQ ID 
NOs: 11 and 12, respectively. The composite protein encoded by the 
overlapping clones S3 and S5 was designated FLP-1 (filamin like firotein). 
Blastp search of protein database (NCBI Blastp) revealed that the composite 
protein FLP-1 has a 73% identity to filamin. Alignment of FLP-1 to fiJamin 

20 shows that clones S3 and S5 represent carboxy terminal regions of FLP-1. 
When FLP-1 is aligned with filamin in the second hinge region between 
repeats 23 and 24, the putative glycoprotein binding region, the degree of 
identity drops to 38%, suggesting a difference in binding affinity between 
filamin and FLP-1 for membrane glycoproteins. 

25 In addition, a region of clone S5 was further found to exhibit 

100% identity to truncated actin-binding protein TABP (GP or GB/M62994), 
a protein previously shown to be a truncated, non-actin-binding filamin-like 
protein [Leedman, et aL, Proc.NatLAcad.Sci.(USA) 90:5994-5998 (1993)] 
having 195 amino acids and a molecular weight of approximately 21 kDa. 

30 Identity was particularly high between nucleotides 950-1515 of clone 5 which 
were 95-99% identical to regions of TABP. TABP lacks an actin binding 
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domain and 22 of 24 tandem repeats found in fiiamin, but contains sequences 
homologous to the carboxy terminal repeats numbered 23 and 24 found in 
fiiamin. The TABP hinge region, between repeats 23 and 24, contains a 
putative glycoprotein binding site and a Ca 2+ /calmodulin kinase II 
5 phosphorylation site [Leedman, supra]. TABP is encoded by a 2,3 kb mRNA 
. and a cDNA encoding TABP was cloned from a thyroid expression library 
from a Graves disease patient [Leedmen, supra]. 

In order to obtain a more complete FLP-1 sequence, the human 
spleen cDNA library was screened using S3 as a probe. The S3 clone was 

10 digested with EcoRl and a L2 kb fragment was isolated and labeled using the 
Random Primed Labeling Kit (Boehringer Mannheim, Indianapolis, IN) 
according to the manufacturer's suggested protocol. Unincorporated 
nucleotides were removed using a Centrisep column (Princeton Separations, 
Adelphia, NJ). The probe was added to filters in hybridization solution (5X 

15 SSPE, 45 % formamide, 5X Denhardts, 1 % SDS) and hybridized overnight at 
42 °C. The filters were washed at a final stringency of 0.2X SSC/0. 1 % SDS 
at 65°C. 

Primary positive clones were picked, diluted and replated on 
Hybond N + filters on LBM plates. Two duplicate filters were rehybridized 

20 with hybridization solution saved from the original hybridization described 
supra. Clones which were positive on both filters were picked, grown and 
their piasrnids isolated and sequenced by standard methods. 

Ten FLP-1 positive clones were detected and partial sequence 
data from these clones was compared to fiiamin and FLP-1 sequences derived 

25 from clones S3 and S5. Overlap of sequences from clones S3 and S5 with 
sequences from clones F3, F5 and F7 permitted determination of a more 
complete sequence for FLP-1, the more complete nucleotide and amino acid 
sequences set out in SEQ ID NOs: 1 and 2, respectively. In SEQ ID NO: 1, 
nucleotides 1-315 were derived from clone F5 (clone F5 was significantly 

30 longer than 315 nucleotides); nucleotides 316-738 from clone F3; nucleotides 
739-816 from clone F7; nucleotides 817-1122 from clone S3 and nucleotides 
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1123-2574 from clone S5. 

The longest clone, F5, was later sequenced in its entirety. 
There are Five differences at the nucleotide level between SEQ ID NO: 1 and 
the F5 sequence. In the F5 sequence, nucleotide 437 is C rather than T 
5 changing amino acid residue 146 from leucine to proline. Nucleotide 1324 is 
C rather than G changing amino acid residue 442 from alanine to proline. 
Nucleotide position 1642 is changed G rather than A thus changing residue 
548 from methionine to valine. In addition, nucleotide 2124 is C rather than 
T and nucleotide 2181 is A rather than T. The nucleotide differences at 
10 positions 2124 and 2181 do not alter the encoded amino residue. The 
sequence differences between the composite sequence of SEQ ID NO: 1 and 
the corresponding F5 FLP-1 sequence may arise from genetic polymorphism 
or the like. 

Example 3 

15 Tissue Specific Expression of FLP-1 

In order to determine size of a mRNA encoding FLP-1 in 
various tissues, a human immune system multiple tissue northern (Clontech) 
was probed with a random-primed portion of clone S3 (corresponding to 
nucleotides 255-777 in SEQ ID NO: 9) according to manufacturer's suggested 

20 protocol. The RNA utilized in the Northern blots included RNA from 
appendix, thymus, lymph node, spleen, bone marrow, fetal liver and 
peripheral blood leukocytes, and cell lines G361, SW480, K562, HeLa, 
HL60, MOLP-4, Raji and A549. 

In spleen, lymph node, thymus, bone marrow, and fetal liver, 

25 mRNA of two distinct sizes hybridized to the FLP-1 probe: one just above and 
one just below the 9.5Kb size marker. In appendix and peripheral blood 
leukocytes, only one band, just below the 9.5Kb size marker, hybridized with 
the FLP-1 probe. These results suggest that the FLP-1 mRNA encodes a 
protein similar in size to filamin as reported in Gorlin, supra, 

30 To determine whether filamin and FLP-1 are expressed in the 

same or in different cell types, Northern blots of mRNA isolated from various 
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tissues and cell types were probed as described above. An antisense 
oligonucleotide filamin probe, 
GGTGGCCTTGGTCAGAGAGTCTACAAACAC (SEQ ID NO: 37), and an 
antisense oligonucleotide FLP-1 probe, 

5 GGCGCTATAGCAGGTCTCTGTAGACGACCT (SEQ ID NO: 38) were 
derived from hinge sequences between repeats 23 and 24 that differ in 23 out 
of a total of 30 nucleotides. These oligonucleotides possess approximately 
equivalent Tms, 81 and 82°C, respectively. The oligonucleotides were 5' 
labelled with 32 P and unincorporated nucleotides were removed using a G-25 

10 Sephadex Quickspin column (BMB). 

The FLP-1 probe was added to the hybridization solution (5X 
SSPE, 2X Denhardt's, 0.5% SDS, 100 pg/ml sheared salmon sperm DNA) 
and multitissue northerns (Clontech) were hybridized overnight at 42 °C. 
Filters were washed according to the manufacturer's suggested protocol at a 

15 final stringency of 2X SSC/0. 1 % SDS at 42 °C. 

After exposure to film, the filters were stripped according to the 
manufacturer's suggested protocol and exposed to film again to ensure that the 
signal due to the FLP-1 probe had been completely removed. The filters were 
then hybridized with the filamin probe. 

20 The FLP-1 probe detected two mRNAs, of approximately 

9.5 and 8.5 kb, in several lymphoid and non-lymphoid tissues and cell 
lines. The filamin probe hybridized to a mRNA of approximately 8.5 kb. 
The levels of filamin mRNA detected in appendix, as well as epithelial 
(G361) and myelomonocytic (HL60) cell lines, appear to be markedly 

25 greater than that of FLP-1 and can be visualized in a 16 hour exposure. In 
contrast, FLP-1 mRNA expression is lower and can be detected only by 
exposing the film for at least seven days. Thus, FLP-1 and filamin mRNA 
are similar in size but appear to be differentially expressed in some tissues 
or cell types. 



BNSDOCID: <WO 9725423A1 J_> 



WO 97/25423 




PCT/US97/00100 



- 1-6 - 
Example 4 

Specificity of Filamin/0 7 and FLP-l/£ 7 Interaction 

The specificity of the interactions of filamin (clones 1271 
and 514) and FLP-1 with the j8 7 integrin cytoplasmic tail was verified by 
5 transforming filamin clone 1271 and FLP-1 clone S5 into Y190 strains 
containing any one of a variety of "baits" vectors (encoding j8 7 or a v 
integrin cytoplasmic tails) using standard methods described supra. Results 
from this assay, shown in Table 1, indicated that filamin clone 1271 
specifically binds to /? 7 integrins but not to other integrins and FLP-1 clone 
10 S5 interacts with /J 7 integrins. 

Table 1 - Binding Specificity of Filamin and FLP-1 
SPECIFICITY OF INTERACTION 
INTEGRIN "BAIT" FILAMIN FLP-1 

15 /J 7 + + 

In vivo interaction between endogenous filamin and # 7 
integrin was also investigated by co-precipitation of a filamin/ a 4 /J 7 complex 
from JY cells, which express endogenous a 4 j3 7 . Cells were initially 

20 permeabilized with 10 ng/ml lysolecithin (Sigma, St. Louis, MO) in PBS, 
pH 8.0, with 1 mM Ca ++ and 1 mM Mg + + , for five minutes. Cellular 
proteins were crosslinked using DTSSP (921 jxM) and labeled with biotin 
as described in Akin, et aL, Anal Biochem. 224:382-389 (1995). 
Crosslinked proteins were solubilized using 1% Triton-XlOO and integrins 

25 were immunoprecipitated using monoclonal antibodies immunospecific for 
a 4 (antibody HP2/1, Immunotech, Westbrook, ME, or antibody B5G10, 
Upstate Biotechnology, Inc., Lake Placid, NY), or j3 2 (antibody 23 mb). 
A control antibody, PC21 (Sigma, St. Louis, MO) was also employed. 
Precipitated proteins were separated on a 6% SDS-PAGE gel, transferred 
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to an Immobilon P membrane and probed with fiiamin antisera (Chemicon 
International, Inc. Temecula, CA). 

These results demonstrate co-precipitation of naturally 
occurring fiiamin with an t* 4 integrin. Also in this assay, fiiamin 
5 co-precipitated with the j3 2 subunit, but was not precipitated with control 
antibody PC21. This implies that a portion of the fiiamin molecule not 
encoded by clone 1271 interacts with a (3 2 integrin. 

Example 5 

Localization of FLP-1 or Fiiamin Binding on 0 7 

In order to more fully characterize the binding between FLP- 
1 or fiiamin and the cytoplasmic tail of jS 7 integrin, the two-hybrid assay 
was employed using various deletion derivatives of either of the individual 
binding partners. 

Several cytoplasmic domain mutants of the /3 7 integrin were 
created using site directed mutagenesis in order to map the site(s) of 
interaction observed as described above. Fiiamin truncates (ABPD1, 
ABPD2 and ABPD5) and clones 1271, 514 and 411 and FLP-1 clones S5 
and S3 were employed to evaluate the degree to which mutations in the /? 7 
cytoplasmic domain affected binding. Following standard co- 
transformations of Y 190 as described above, binding interactions were 
determined by /8-gal assay, as described above. The 0 7 deletions utilized in 
these assays are set out in SEQ ID NOS: 14 to 18 and 39-41 below, and 
compared to the native j3 7 sequence set out in SEQ ID NO: 13. In each 
expression construct, only the cytoplasmic portion of /S 7 , or a truncation 
thereof, was subcloned. 

P 7 YRl^VEIYDRREYSRFElOEQQQLNWKQDSNPLYKSAITTTINPRFQEADSPTL 

(SEQ ID NO: 13) 

0 7 D1 YRLSVEIYDRREYSRFEKEQQQLNWKQDSNP 
(SEQ ID NO: 14) 



15 



20 
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tf 7 D2 YRLSVEIYDRREYSRFEKEQQQLNWKQDSNPLYKSA 
(SEQ ID NO: 15) 

£ 7 D3 YRLSVEIYDRREYSRFEKEQQQLNWKQDSNPLYKSAITTTINP 

(SEQ ID NO: 16) 

5 /J 7 D4 YRLSVEIYDRREYSRFEKE 

(SEQ ID NO: J 7) 

0 7 D5 YRLSVEIYDRREYSR 
(SEQ ID NO: 18) 

0 7 D6 YRLSVEIYDRR 
10 (SEQ ID NO: 39) 

0 7 D8 YRLSVEIYDRREYSRFEKEQQQLNWKQDSNPLYKSAITTTINPRFQEAD 

(SEQ ID NO: 40) 

0,D9 YRLSVEIYDRREYSRFEKEQQQLNWKQDSNPLYKSAITTTINPRF 

(SEQ ID NO: 41) 

1 5 Primers used to generate the various deletion mutants are set 

out in SEQ ID NOs: 19 to 23 and 42-46, below, and were individually 
utilized in an amplification reaction pairs with the primer set out in SEQ ID 
NO: 3. Reaction conditions were as described in Example 1. Deletions 
/? 7 D8 and £ 7 D9 were prepared using Quickchange site directed mutagenesis 

20 (Stratagene, La Jolla, CA) and all other deletions were prepared by 
standard single stranded site directed mutagenesis. 

NH0 7 D1 GATGGCACTTTTGTACTAAGGATTACTGTCCTG (SEQ ID NO: 19) 
NH/? 7 D2 ATTGATGGTGGTCGTCTAGGCACTTTTGTAGAG (SEQ ID NO: 20) 
NH0 7 D3 GTCTGCCTCTTGAAACTAAGGATTGATGGTGGT (SEQ ID NO: 21) 
25 NH0 7 D4 CC A GTTG A G TTG TTG CTA CTCCTTCTC AAAG C G (SEQ ID NO: 22) 

NH0 7 D5 GTTGCTGCTCCTTCTCCTAGCGACTGTATTCCCG (SEQ ID NO: 23) 
0 7 D6: CTC A AAGCG ACTGT A CT A CCGGCGGTC AT AG ATTTC (SEQ ID NO: 42) 
/? 7 D8: CTTTCAAGAGGCAGACTGACCCACTCTCTGAGGA (sense oligo) 

(SEQ ID NO: 43) 

30 /? 7 D8: TCCTCAGAGAGTGGGTCAGTCTGCCTCTTGAAAG (antisense oligo) 

(SEQ ID NO: 44) 

0 7 D9: CATCAATCCTCGCTTTTGAGAGGCAGACAGTCCC (sense oligo) 

(SEQ ID NO: 45) 
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GGGACTGTCTGCCTCTCAAAAGCGAGGATTGATC (antisense oligo) 
(SEQ ID NO: 46) 

In addition, a series of /3 7 substitution mutants were also 
constructed wherein the sequence changes are set out in SEQ ID NOs: 24 
5 to 27 and 47-51, with the substituted amino acid residue underlined. 



0 7 S3A 

YRLAVEIYDRREYSRFEK^QQQLNWKQDSNPLYKSAITTTINPRFQEADSPTL 

(SEQ ID NO: 24) 

0 7 E5Q 

10 YRl^VQIYDRREYSRFEKJEQQQLNWKQDSNPLYXSAITTTINPRFQEADSPTL 

(SEQ ID NO: 25) 

0 7 R9A 

YRI^VEIYDAREYSRFEKEQQQLNWKQDSNPLYKSAITTTINPRFQEADSPTL 

(SEQ ID NO: 26) 

15 0 7 S13A 

YRLSVEIYDRREYARFEKEQQQLNWKQDSNPLYKSAITTT1NPRFQEAJDSPTL 

(SEQ ID NO: 27) 

j3 7 V4F 

YRLSFEIYDRREYSRFEKEQQQLNWKQDSNPLYKSAITTTINPRFQEADSPTL 
20 (SEQ ID NO: 47) 

YRLSVEFYDRREYSRFEKEQQQLNWKQDSNPLYKSAITTTINPRFQEADSPTL 

(SEQ ID NO: 48) 

/3.Y7F 

25 YRI^VEIFDRREYSRFEKEQQQLNWKQDSNPLYKSAITTTTNPRFQEADSPTL 

(SEQ ID NO: 49) 

0 7 D8A 

YRI^VEIYAIUIEYSRFEKJEQQQLNWKQDSNPLYKSAITTTINPRFQEADSPTL 

(SEQ ID NO: 50) 

30 0 7 R1OA 

YRLSVEIYDRAEYSRFEKEQQQLNWKQDSNPLYKSAITTTINPRFQEADSPTL 

(SEQ ID NO: 51) 

Oligonucleotides used to generate the various substitution 
variants are set out in SEQ ID NOs: 28 to 31 and 52 to 56, infra. 
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B7S3A GTCATAGATTTCCACCGCGAGCCGGTATCCGAG (SEQ ID NO; 28) 
B7E5Q CCGGCCGTCATAGATTTGCACCGAGAGCCGGTATC (SEQ ID NO: 29) 

B7R9A GCGACTGTATTCCCGCGCGTCATAGATTTCCAC (SEQ ID NO: 30) 

B7S13A CTCCTTCTC AAAG CG CG CGTATTCCCGG CGGTC (SEQ ID NO: 31) 

5 )9 7 V4F GCGGTCATAGATTTCAAACGAGAGCCGGTATCC (SEQ ID NO: 52) 

/3 7 I6F TTCCCG G CGGTC ATAG AATTCC ACCG AG AGCCG (SEQ ID NO: 53) 

/? 7 Y7F GTATTCCCGGCGGTCAAAGATTTCCACCGAGAG (SEQ ID NO: 54) 

/? 7 D8A ACTGTATTCCCGGCGCGCATAGATTTCCACCGA (SEQ ID NO: 55) 

&R10A AAAGCGACTGTATTCCGCGCGGTCATAGATTTC (SEQ ID NO: 56) 

!0 Specific truncation mutants of filamin were generated by 

PCR amplification of existing clones under conditions described in Example 
1 Mutant ABPD1 encoded a region including a portion of repeat 23, the 
second hinge region and repeat 24 of filamin (amino acid 2487-2647 in 
SEQ ID NO: 7). Mutant ABPD2 (amino acids 2487-2577 in SEQ ID NO: 

15 7) encoded a truncated form of ABPD1 which lacked the filamin 

dimerization domain. Mutant ABPD4 (amino acids 2517-2647 in SEQ ID 
NO: 7) encoded a truncated form of ABPDl which lacked the twenty-third 
repeat. Mutant ABPD5 (amino acid 2198-2435 in SEQ ID NO: 7) encoded 
a truncated form of clone 411 which lacked most of repeat 23, the second 

20 hinge region and repeat 24. Mutant ABPD9 (amino acid 2350-2435 in 
SEQ ID NO: 7) encoded a truncated form of ABPD5. Mutant ABPDl 0 
(amino acid 2256-2363 in SEQ ID NO: 7) encoded another truncated form 
of ABPD5. 

Mutant ABPDl was generated by PCR using primers set out 
25 in SEQ ID NO: 32 and 33, and mutant ABPD2 was generated by PCR 
using primers set out in SEQ ID NO: 32 and 34. 



ABP.5x ATATCTCG AG AGTATACCCCC ATGGC ACCT (SEQ ID NO: 32) 
ABP.Xhol ATATCTCGAGTCAGGGCACCACAACGCG (SEQ ID NO: 33) 
ABP.Xho2 ATATCTCG AGTCAGCTGCTCTTCTGGCCCTAC (SEQ ID NO: 34) 
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Primers 32-34 were used in a reaction with fdamin clone 
1271 under the following amplification conditions: an initial incubation at 
94°C for five minutes, followed by thirty cycles of 94°C for 30 seconds, 
50°C for 30 seconds, and 72 °C for one minute. The resulting PCR 
5 product was cut with Xhol and ligated into vector pACT (described in 
Example 1) previously digested with Xhol. 

Mutants ABPD4, ABPD5, ABPD9 and ABPD10 were 
generated by PCR. ABPD4 was generated using primers 1271/151 and 
1271/3XR and used clone 1271 as the DNA template. ABPD5 was 
10 generated using primers B741 1/1X and B7411/700X and used clone 41 1 as 
the DNA template. ABPD9 was generated using primers B7411/457X and 
B7411/700X and used clone 411 as the DNA template. ABPD10 was 
generated using primers B7411/175X and B7411/498X and used clone 411 
as the DNA template. 

15 1271/151 CCCGAATTCACAGGCCCCCGTCTCGTC (SEQ ID NO: 57) 

1271/3XR CCCGAATTCCTCGAGTCAGGGCACCACAACGCGGTAG 

(SEQ ID NO: 58) 

B7411/IX CCCCCTCGAGGCTACTGCATCCGCTTTGTTC (SEQ ID NO: 59) 
B741 1/700X CCCCTCGAGTCAGTAAGCAGACACCAAGCC (SEQ ID NO: 60) 
20 B741J/457X CCCCTCGAGCCAGCCTCTTTTGCAGTC (SEQ ID NO: 61) 

B7411/175X CCCCTCG AG CC AG CCG A ATTC AGTATC (SEQ ID NO: 62) 
B7411/498X CCCCTCG AGTCACGCCCCCTTGGCCCCCTTC (SEQ ID NO: 63) 

Primers as described were used in PCR reactions with the 
appropriate templates under amplification conditions outlined in Example 1 . 
25 The resulting PCR products were cut with Xhol (ABPD5, ABPD9 and 
ABPD10) or EcoRI (ABPD4) and ligated into vector pACT (ABPD5) or 
vector pACT2 (ABPD9 and ABPD10) previously digested with Xhol or 
ligated into vector pGADIO (ABPD4) previously digested with EcoRI. The 
resulting subclones were sequenced to rule out PCR derived errors. 



BNSDOCID: <WO 9725423A1 J_> 



WO 97/25423 



PCT/US97/00100 



- 22 - 

An FLP- 1 mutant comprised of amino acid sequences 696 to 
857 in SEQ ID NO: 1 and showing identity to TABP (the TABP-like 
analog) was also generated by PCR amplification (under conditions 
described in Example 1) from a human spleen cDNA library. The FLP-1 
5 mutant was generated by PCR using the primer pair set out in SEQ ID NO: 
35 and 36. 

TABP.Nde ATATCATATGTACACCCCCATGGCTCCT (SEQ ID NO: 35) 
TABP. Bam ATAGGATCCTCAGCCCCACAAACAGGC (SEQ ID NO: 36) 

Reactions were carried out using 2.5 /*g spleen cDNA under the following 

10 amplification conditions: an initial incubation at 94°C for five minutes, 
followed by thirty cycles of 94°C for 30 seconds, 50°C for 30 seconds, 
and 72 °C for one minute. The resulting PCR products were digested with 
Ndel and BamHl and cloned into vector pET previously digested with the 
same enzymes. The resulting TABP/pET vector was then utilized in a 

15 secondary PCR with the PCR primer pair set out in SEQ ID NO: 32 and 
33, above, under the following conditions: an initial incubation at 94°C 
for five minutes, followed by thirty cycles at 94°for one minute, 50°C for 
one minute and 72 °C for two minutes. The resulting PCR product was 
digested with Xhoi and cloned into pACT previously digested with Xhol. 

20 The FLP-1 TABP-like truncate represents the same size and region in 
filamin as represented by mutant ABPD1. 

Another FLP-1 mutant, FLP1D3 ? was generated by 
PCR. FLP1D3 encoded a truncated form of clone S5 (amino acid 272-483 
in SEQ ID NO: 11), FLP1D3 represents the carboxy terminal region of S5, 

25 which is not encoded by S3. The primers used to generate this mutant 
were B7S5/814X and B7S5/1475X and clone S5 was used as the DNA 
template. 
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B7S5/814X CCCCCTCGAGGCGGCACGGGACTCGAAGGG (SEQ ID NO: 64) 
B7S5/1475X CCCCTCGAGTTAAGGCACTGTGACATG (SEQ ID NO: 65) 

These primers were utilized in PCR reactions under 
amplification conditions outlined in Example I . The resulting PCR 

. 5 products were digested with Xhol and ligated into the vector pACT 

previously digested with Xhol. Sequencing of the resulting subclones ruled 
out PCR derived errors. 

Results from the two hybrid assays as shown in Tables 2-4, 
discussed below, indicate that there are two distinct regions of filamin 

10 capable of interacting with the (3 7 cytoplasmic tail. The first region is 

represented by clones 514 and 1271, the second region by deletion mutant 
ABPD5. In the first binding region of filamin (as represented by clones 
514 and 1271) the dimerization domain (amino acids 2578-2647 of SEQ ID 
NO: 7) (not present in ABPD2) and the region at the 5 1 end of repeat 23 

15 (not present in ABPD4) appear to be critical for interaction with the j8 7 
cytoplasmic tail. The results with TABP and FLP1D3 also show that 
despite a high degree of homology with filamin, there does not appear to 
be a corresponding region in FLP-1 similar to the "repeat 23-24" region 
found in filamin clones 514 and 1271 which is capable of interacting with 

20 the (3 7 cytoplasmic tail. In addition, interaction with ABPD5 indicates a 

second region of filamin centered around repeat 21 which interacts with the 
jS 7 cytoplasmic tail, corresponding to a region of FLP-1 (amino acid 1-273 
of SEQ ID NO: 12), and is most likely to be responsible for the FLP-1 
interaction with (3 7 cytoplasmic taiL 
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15 



TABLE 2 - INTERACTION OF FILAMIN (1271) 
AND FLP-1 <S5) WITH 0 7 DELETION AND 
SUBSTITUTION ANALOGS 
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TABLE 3 - INTERACTION OF 0 7 WITH FILAMIN 
AND FLP-1 AND DELETION MUTANTS 






FILAMIN 




1271 


+ 


514 




41 1 




ABPDl 


+ /- 


ABPD2 




ABPD4 




ABPD5 


+ 


ABPD9 




ABPDl 0 




FLP-1 




S3 


+ 


S5 


+ 


TABP 




FLP1D3 





Table 2 and Table 4 below summarize the effect of j3 7 deletion 
mutants and substitution analogs on binding of 0 7 to filamin (clone 514), 

20 ABPDL ABPD2, ABPDS, TABP and FLP-1 clones S3 and S5. The 

binding properties of the first filamin binding site, represented by clones 
514 and 1271, is affected by substitutions in the membrane proximal region 
of the /3 7 cytoplasmic tail. Specifically, substitution mutant ESQ greatly 
weakens the interaction with clones 514 and 1271. Substitution mutant 

25 D8A completely disrupts the interaction of /3 7 with clones 514 and 1271 
(Table 4). The binding of ABPD5 to /J 7 (the second region of filamin 
which interacts with the j3 7 cytoplasmic tail) is not affected by substitutions 
in the membrane proximal region of the )S 7 cytoplasmic tail, as shown by 
substitution mutants ESQ and D8A. However, the binding of ABPD5 to j8 7 

30 is decreased in deletion mutants at the carboxy terminus of the /3 7 

cytoplasmic tail, as shown by deletion mutants )3 7 D1 and /J 7 D2 (Table 4). 
FLP-1 clones S3 and S5 demonstrate a pattern of interaction with the /3 7 
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deletion and substitution mutants that is remarkably similar to ABPD5. 
Because ABPD5, S3 and S5 were able to interact with deletion mutants 
smaller than /? 7 D1 and /3 7 D2, such as /J 7 D6, it is possible that this region of 
fdamin or FLP-1 can interact with more than one region of the /J 7 
5 cytoplasmic tail. 



TABLE 4 - INTERACTION OF 
FILAMIN (514) WITH 0 T DELETION 
AND SUBSTITUTION ANALOGS 
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25 The data presented in this Example demonstrates that there 

are two distinct regions of fdamin which interact with two distinct regions 
of the jS 7 cytoplasmic tail. They also show that the region of FLP-1 which 
interacts with the /? 7 cytoplasmic tail is similar to the ABPD5 region of 
filamin in its interaction characteristics with the |8 7 cytoplasmic tail. 
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Example 6 

Applications for Modulators of FiIamin/0 7 and FLP-l//3 7 Binding 



Two /3 7 associated integrins have been identified: Gr 4 /3 7 and 
a^3 7 . Both are expressed on a subpopulation of peripheral blood 
lymphocytes and their expression is inducible. Both are expressed on 
macrophages but not monocytes and both appear to function in homing or 
localization of lymphocytes to mucosal tissue [see review in Jutila, J. 
Leukocyte Biol 55:133-140 (1994)]. The homing properties of <* 4 /? 7 can be 
attributed to interaction with MadCAM-1 expressed in mucosal nodes, 
while the retention of a^j* cells in the gut is attributed to interactions with 
epithelial cells expressing E-cadherin. Thus, binding by one or both /? 7 
integrins to their respective counter-receptor may contribute to mucosal 
immune responses as well as inflammatory {e.g., inflammatory bowel 
disease, IBD) and autoimmune responses at this site. 

Further, it has been suggested that filamin is important in 
cell locomotion due to the fact that cells expressing low levels of the 
protein do not form leading lamella structures required for locomotion. 
The structural homology of FLP-1 to filamin suggests a similar role for this 
protein. In view of the observation that integrins can be observed clustered 
in point contacts, which are also important in cell locomotion, the invention 
contemplates that 0 7 interaction with FLP-1 and/or filamin may be crucial 
to cell movement, and that disruption of the interactions will be useful, for 
example, in preventing the homing of $^ cells which occurs in certain 
pathological inflammatory responses such as IBD. 

In order to identify modulators of 0 7 /FLP-l interaction, it is 
necessary to clearly define the portions of both proteins which are 
necessary for binding. Amino acid substitution, through standard 
mutagenesis techniques will permit identification of the binding regions of 
the proteins. Deletion analysis, wherein truncated forms of either protein 
are generated, for example by PCR, is also useful for identification of 
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binding regions if the deletion does not disrupt the tertiary or quaternary 
structure of the protein to the point that it is no longer recognized buy its 
counter-receptor. 

Identification of the significant protein regions involved in 
5 binding permits more accurate and efficient screening of putative 

modulators of binding activity. The invention contemplates of a high 
throughput screening assay to analyze large libraries of small molecules or 
peptides, as well as antibodies immunospecific for either or both binding 
partners, for the ability to modulate binding of 0 7 integrins to FLP-1 or 

10 filamin. While two hybrid screening, scintillation proximity assays (SPA) 
and immunological methodologies, for example, enzyme-linked 
immunosorbent assays (ELISA), disclosed herein are not HTS methods per 
se, they are amenable to test many of the compounds listed for an ability to 
modulate binding. SPA and ELISA are particularly useful in this 

15 identification process and can be modified to permit high throughput 
screening of the test compounds described. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION; 

(i) APPLICANT : ICOS CORPORATION 



(ii) TITLE OF INVENTION: Cytoplasmic Modulators of Integrin 
Binding 

(iii) NUMBER OF SEQUENCES: 65 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Marshall, O' Toole, Gerstein, Murray &. Borun 

{B) STREET: 233 South Wacker Drive, 6300 Sears Tower 

(C) CITY: Chicago 

(D) STATE: Illinois 

(E) COUNTRY: United States of America 

(F) ZIP : 60606 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi> CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Greta E. Noland 

(B) REGISTRATION NUMBER: 35,302 

(C) REFERENCE /DOCKET NUMBER: 27866/33773 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 312-474-6300 

(B) TELEFAX: 312-474-0448 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

( i i ) MOLECULE TYPE : CDNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..2574 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CCT TTT GAC CTG GTC ATT CCG TTT GCT GTC AGG AAA GGA GAA ATC ACT 4 8 

Pro Phe Asp Leu Val lie Pro Phe Ala Val Arg Lys Gly Glu lie Thr 
15 10 15 



GGA GAG GTC CAC ATG CCT TCT GGG AAG ACA GCC ACA CCT GAG ATT GTG 9 6 

Gly Glu Val His Met Pro Ser Gly Lys Thr Ala Thr Pro Glu lie Val 

20 25 30 
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144 



192 



GAG AAC AAG GAC GGC ACG GTC ACT GTT AGA TAT GCC CCC ACT GAG GTC 
Asp Asn Lys Asp Gly Thr Val Thr Val Arg Tyr Ala Pro Thr Glu Val 
35 40 45 

GGG CTC CAT GAG ATG CAC ATC AAA TAC ATG GGC AGC CAC ATC CCT GAG 
Gly Leu His Glu Met His lie Lys Tyr Met Gly Ser His He Pro Glu 
50 55 60 

AGC CCA CTC CAG TTC TAC GTG AAC TAC CCC AAC AGT GGA AGT GTT TCT 24 0 

Ser Pro Leu Gin Phe Tyr Val Asn Tyr Pro Asn Ser Gly Ser Val Ser 
65 70 75 80 

GCA TAC GGT CCA GGC CTC GTG TAT GGA GTG GCC AAC AAA ACT GCC ACC 288 
Ala Tyr Gly Pro Gly Leu Val Tyr Gly Val Ala Asn Lys Thr Ala Thr 
85 90 95 

TTC ACC ATC GTC ACA GAG GAT GCA GGA GAA GGT GGT CTG GAC TTG GCT 3 36 

Phe Thr He Val Thr Glu Asp Ala Gly Glu Gly Gly Leu Asp Leu Ala 
100 105 no 

ATT GAG GGC CCC TCA AAA GCA GAA ATC AGC TGC ATT GAC AAT AAA GAT 3 84 

He Glu Gly Pro Ser Lys Ala Glu He Ser Cys He Asp Asn Lys Asp 
115 120 125 

GGG ACA TGC ACA GTG ACC TAC CTG CCG ACT CTG CCA GGC GAC TAC AGC 4 32 

Gly Thr Cys Thr Val Thr Tyr Leu Pro Thr Leu Pro Gly Asp Tyr Ser 
130 135 140 

ATT CTG GTC AAG TAC AAT GAC AAG CAC ATC CCT GGC AGC CCC TTC ACA 4 80 

He Leu Val Lys Tyr Asn Asp Lys His He Pro Gly Ser Pro Phe Thr 
145 150 155 160 

GCC AAG ATC ACA GAT GAC AGC AGG CGG TGC TCC CAG GTG AAG TTG GGC 52 8 

Ala Lys He Thr Asp Asp Ser Arg Arg Cys Ser Gin Val Lys Leu Gly 
165 170 175 

TCA GCC GCT GAC TTC CTG CTC GAC ATC AGT GAG ACT GAC CTC AGC AGC 5 76 

Ser Ala Ala Asp Phe Leu Leu Asp He Ser Glu Thr Asp Leu Ser Ser 
180 185 190 

CTG ACG GCC AGC ATT AAG GCC CCA TCT GGC CGA GAC GAG CCC TGT CTC 624 
Leu Thr Ala Ser He Lys Ala Pro Ser Gly Arg Asp Glu Pro Cys Leu 
135 200 205 

CTG AAG AGG CTG CCC AAC AAC CAC ATT GGC ATC TCC TTC ATC CCC CGG 6 72 

Leu Lys Arg Leu Pro Asn Asn His He Gly He Ser Phe He Pro Arq 
210 215 220 

GAA GTG GGC GAA CAT CTG GTC AGC ATC AAG AAA AAT GGC AAC CAT GTG 720 
Glu Val Gly Glu His Leu Val Ser He Lys Lys Asn Gly Asn His Val 
225 230 235 240 

GCC AAC AGC CCC GTG TCT ATC ATG GTG GTC CAG TCG GAG ATT GGT GAC 76 8 

Ala Asn Ser Pro Val Ser He Met Val Val Gin Ser Glu He Gly Asp 
245 250 255 

GCC CGC CGA GCC AAA GTC TAT GGC CGC GGC CTG TCA GAA GGC CGG ACT 816 
Ala Arg Arg Ala Lys Val Tyr Gly Arg Gly Leu Ser Glu Gly Arg Thr 
260 265 270 

TTC GAG ATG TCT GAC TTC ATC GTG GAC ACA AGG GAT GCA GGT TAT GGT 864 
Phe Glu Met Ser Asp Phe He Val Asp Thr Arg Asp Ala Gly Tyr Gly 
275 280 285 

GGC ATA TCC TTG GCG GTG GAA GGC CCC AGC AAA GTG GAC ATC CAG ACG 912 
Gly lie Ser Leu Ala Val Glu Gly Pro Ser Lys Val Asp He Gin Thr 
290 295 300 
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GAG GAC CTG GAA GAT GGC ACC TGC AAA GTC TCC TAC TTC CCT ACC GTG 9 60 

Glu Asp Leu Glu Asp Gly Thr Cys Lys Val Ser Tyr Phe Pro Thr Val 
305 310 315 320 

CCT GGG GTT TAT ATC GTC TCC ACC AAA TTC GCT GAC GAG CAC GTG CCT 100 8 

Pro Gly Val Tyr lie Val Ser Thr Lys Phe Ala Asp Glu His Val Pro 
325 330 335 

GGG AGC CCA TTT ACC GTG AAG ATC AGT GGG GAG GGA AGA GTC AAA GAG 1056 
Gly Ser Pro Phe Thr Val Lys He Ser Gly Glu Gly Arg Val Lys Glu 
340 345 350 

AGC ATC ACC CGC ACC AGT CGG GCC CCG TCC GTG GCC ACT GTC GGG AGC 1104 
Ser He Thr Arg Thr Ser Arg Ala Pro Ser Val Ala Thr Val Gly Ser 
355 360 365 

ATT TGT GAC CTG AAC CTG AAA ATC CCA GAA ATC AAC AGC AGT GAT ATG 1152 
He Cys Asp Leu Asn Leu Lys He Pro Glu He Asn Ser Ser Asp Met 
370 375 380 

TCG GCC CAC GTC ACC AGC CCC TCT GGC CGT GTG ACT GAG GCA GAG ATT 1200 
Ser Ala Hxs Val Thr Ser Pro Ser Gly Arg Val Thr Glu Ala Glu He 
385 390 395 400 

GTG CCC ATG GGG AAG AAC TCA CAC TGC GTC CGG TTT GTG CCC CAG GAG 124 8 

Val Pro Met Gly Lys Asn Ser His Cys Val Arg Phe Val Pro Gin Glu 
405 410 415 

ATG GGC GTG CAC ACG GTC AGC GTC AAG TAC CGT GGG CAG CAC GTC ACC 129 6 

Met Gly Val His Thr Val Ser Val Lys Tyr Arg Gly Gin His Val Thr 
420 425 430 

GGC AGC CCC TTC CAG TTC ACC GTG GGG GCA CTT GGT GAA GGA GGC GCC 134 4 

Gly Ser Pro Phe Gin Phe Thr Val Gly Ala Leu Gly Glu Gly Gly Ala 
435 440 445 

CAC AAG GTG CGG GCA GGA GGC CCT GGC CTG GAG AGA GGA GAA GCG GGA 139 2 

His Lys Val Arg Ala Gly Gly Pro Gly Leu Glu Arg Gly Glu Ala Gly 
450 455 460 

GTC CCA GCT GAG TTC AGC ATT TGG ACC CGG GAA GCA GGC GCT GGA GGC 144 0 

Val Pro Ala Glu Phe Ser He Trp Thr Arg Glu Ala Gly Ala Gly Gly "~ 
465 470 475 480 

CTC TCC ATC GCT GTT GAG GGC CCC AGT AAG GCC GAG ATT ACA TTC GAT 14 88 

Leu Ser He Ala Val Glu Gly Pro Ser Lys Ala Glu He Thr Phe Asp 
485 490 495 

GAC CAT AAA AAT GGG TCG TGC GGT GTA TCT TAT ATT GCC CAA GAG CCT 153 6 

Asp His Lys Asn Gly Ser Cys Gly Val Ser Tyr He Ala Gin Glu Pro 
500 505 510 

GGT AAC TAC GAG GTG TCC ATC AAG TTC AAT GAT GAG CAC ATC CCG GAA 1584 
Gly Asn Tyr Glu Val Ser He Lys Phe Asn Asp Glu His lie Pro Glu 
515 520 525 

AGC CCC TAC CTG GTG CCG GTC ATC GCA CCC TCC GAC GAC GCC CGC CGC 163 2 

Ser Pro Tyr Leu Val Pro Val lie Ala Pro Ser Asp Asp Ala Arg Arg 
530 535 540 

CTC ACT GTT ATG AGC CTT CAG GAA TCG GGA TTA AAA GTT AAC CAG CCA 1680 
Leu Thr Val Met Ser Leu Gin Glu Ser Gly Leu Lys Val Asn Gin Pro 
545 550 555 560 

GCA TCC TTT GCT ATA AGG TTG AAT GGC GCA AAA GGC AAG ATT GAT GCA 1728 
Ala Ser Phe Ala lie Arg Leu Asn Gly Ala Lys Gly Lys He Asp Ala 
565 570 575 
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AAG GTG CAC AGC CGC TCT GGA GCC GTG GAG GAG TGC CAC GTG TCT GAG 1776 

Lys Val His Ser Pro Ser Gly Ala Val Glu Glu Cys His Val Ser Glu 
580 585 590 

CTG GAG CCA GAT AAG TAT GCT GTT CGC TTC ATC CCT CAT GAG AAT GGT 18 24 

Leu Glu Pro Asp Lys Tyr Ala Val Arg Phe lie Pro His Glu Asn Gly 
595 600 605 

GTC CAC ACC ATC GAT GTC AAG TTC AAT GGG AGC CAC GTG GTT GGA AGC 1872 
Val His Thr lie Asp Val Lys Phe Asn Gly Ser His Val Val Gly Ser 
610 615 620 

CCC TTC AAA GTG CGC GTT GGG GAG CCT GGA CAA GCG GGG AAC CCT GCC 192 0 

Pro Phe Lys Val Arg Val Gly Glu Pro Gly Gin Ala Gly Asn Pro Ala 
625 630 635 640 

CTG GTG TCC GCC TAT GGC ACG GGA CTC GAA GGG GGN ACC ACA GGT ATC 19 68 

Leu Val Ser Ala Tyr Gly Thr Gly Leu Glu Gly Xaa Thr Thr Gly lie 
645 650 655 

CAG TCG GAA TTC TTT ATT AAC ACC ACC CGA GCA GGT CCA GGG ACA TTA 2016 
Gin Ser Glu Phe Phe lie Asn Thr Thr Arg Ala Gly Pro Gly Thr Leu 
660 665 670 

TCC GTC ACC ATC GAA GGC CCA TCC AAG GTT AAA ATG GAT TGC CAG GAA 2 0 64 

Ser Val Thr lie Glu Gly Pro Ser Lys Val Lys Met Asp Cys Gin Glu 
675 680 685 

ACA CCT GAA GGG TAC AAA GTC ATG TAC ACC CCC ATG GCT CCT GGT AAC 2112 
Thr Pro Glu Gly Tyr Lys Val Met Tyr Thr Pro Met Ala Pro Gly Asn 
690 695 700 

TAC CTG ATC AGT GTC AAA TAC GGT GGG CCC AAC CAC ATC GTG GGC AGT 2160 
Tyr Leu lie Ser Val Lys Tyr Gly Gly Pro Asn His He Val Gly Ser 
705 710 715 720 

CCC TTC AAG GCC AAG GTG ACT GGC CAG CGT CTA GTT AGC CCT GGC TCA 2208 
Pro Phe Lys Ala Lys Val Thr Gly Gin Arg Leu Val Ser Pro Gly Ser 
725 730 735 

GCC AAC GAG ACC TCA TCC ATC CTG GTG GAG TCA GTG ACC AGG TCG TCT 22 56 

Ala Asn Glu Thr Ser Ser He Leu Val Glu Ser Val Thr Arg Ser Ser 
740 745 750 

ACA GAG ACC TGC TAT AGC GCC ATT CCC AAG GCA TCC TCG GAC GCC AGC 2 3 04 

Thr Glu Thr Cys Tyr Ser Ala He Pro Lys Ala Ser Ser Asp Ala Ser 
755 760 765 

AAG GTG ACC TCT AAG GGG GCA GGG CTC TCA AAG GCC TTT GTG GGC CAG 23 52 

Lys Val Thr Ser Lys Gly Ala Gly Leu Ser Lys Ala Phe Val Gly Gin 
770 775 780 

AAG AGT TCC TTC CTG GTG GAC TGC AGC AAA GCT GGC TCC AAC ATG CTG 24 0 0 

Lys Ser Ser Phe Leu Val Asp Cys Ser Lys Ala Gly Ser Asn Met Leu 
785 790 795 800 

CTG ATC GGG GTC CAT GGG CCC ACC ACC CCC TGC GAG GAG GTC TCC ATG 244 8 

Leu He Gly Val His Gly Pro Thr Thr Pro Cys Glu Glu Val Ser Met 
805 810 815 

AAG CAT GTA GGC AAC CAG CAA TAC AAC GTC ACA TAC GTC GTC AAG GAG 249 6 

Lys His Val Gly Asn Gin Gin Tyr Asn Val Thr Tyr Val Val Lys Glu 
820 825 830 

AGG GGC GAT TAT GTG CTG GCT GTG AAG TGG GGG GAG GAA CAC ATC CCT 2544 
Arg Gly Asp Tyr Val Leu Ala Val Lys Trp Gly Glu Glu His He Pro 
835 840 845 
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GGC AGC CCT TTT CAT GTC ACA GTG CCT TAA 2 5 74 

Gly Ser Pro Phe His Val Thr Val Pro 
850 855 

(2) INFORMATION FOR SEQ ID NO : 2 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 57 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Pro Phe Asp Leu Val lie Pro Phe Ala Val Arg Lys Gly Glu lie Thr 
15 10 15 

Gly Glu Val His Met Pro Ser Gly Lys Thr Ala Thr Pro Glu lie Val 
20 25 30 

Asp Asn Lys Asp Gly Thr Val Thr Val Arg Tyr Ala Pro Thr Glu Val 
35 40 45 

Gly Leu His Glu Met His lie Lys Tyr Met Gly Ser His lie Pro Glu 
50 55 60 

Ser Pro Leu Gin Phe Tyr Val Asn Tyr Pro Asn Ser Gly Ser Val Ser 
65 70 75 80 

Ala Tyr Gly Pro Gly Leu Val Tyr Gly Val Ala Asn Lys Thr Ala Thr 

85 90 95 : 

Phe Thr lie Val Thr Glu Asp Ala Gly Glu Gly Gly Leu Asp Leu Ala 
100 105 110 

lie Glu Gly Pro Ser Lys Ala Glu lie Ser Cys lie Asp Asn Lys Asp 
115 120 125 

Gly Thr Cys Thr Val Thr Tyr Leu Pro Thr Leu Pro Gly Asp Tyr Ser 

130 135 140 > 

lie Leu Val Lys Tyr Asn Asp Lys His lie Pro Gly Ser Pro Phe Thr 
145 150 155 160 

Ala Lys lie Thr Asp Asp Ser Arg Arg Cys Ser Gin Val Lys Leu Gly 
165 170 175 

Ser Ala Ala Asp Phe Leu Leu Asp lie Ser Glu Thr Asp Leu Ser Ser 
180 185 190 

Leu Thr Ala Ser lie Lys Ala Pro Ser Gly Arg Asp Glu Pro Cys Leu 
195 200 205 

Leu Lys Arg Leu Pro Asn Asn His lie Gly lie Ser Phe lie Pro Arg 
210 215 220 

Glu Val Gly Glu His Leu Val Ser lie Lys Lys Asn Gly Asn His Val 
225 230 235 240 

Ala Asn Ser Pro Val Ser lie Met Val Val Gin Ser Glu lie Gly Asp 
245 250 255 

Ala Arg Arg Ala Lys Val Tyr Gly Arg Gly Leu Ser Glu Gly Arg Thr 
260 265 270 
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PHe "GXu" Met Ser Asp Phe lie Val Asp Thr Arg Asp Ala Gly Tyr Gly 
275 280 285 

Gly lie Ser Leu Ala Val Glu Gly Pro Ser Lys Val Asp lie Gin Thr 
290 295 300 

Glu Asp Leu Glu Asp Gly Thr Cys Lys Val Ser Tyr Phe Pro Thr Val 
305 310 315 320 

Pro Gly Val Tyr lie Val Ser Thr Lys Phe Ala Asp Glu His Val Pro 

325 . . 330 335 

Gly Ser Pro Phe Thr Val Lys lie Ser Gly Glu Gly Arg Val Lys Glu 
340 345 350 

Ser lie Thr Arg Thr Ser Arg Ala Pro Ser Val Ala Thr Val Gly Ser 
355 360 365 

lie Cys Asp Leu Asn Leu Lys lie Pro Glu He Asn Ser Ser Asp Met 
370 375 380 

Ser Ala His Val Thr Ser Pro Ser Gly Arg Val Thr Glu Ala Glu He 
385 390 395 400 

Val Pro Met Gly Lys Asn Ser His Cys Val Arg Phe Val Pro Gin Glu 
405 410 415 

Met Gly Val His Thr Val Ser Val Lys Tyr Arg Gly Gin His Val Thr 
420 425 430 

Gly Ser Pro Phe Gin Phe Thr Val Gly Ala Leu Gly Glu Gly Gly Ala 
435 440 445 

His Lys Val Arg Ala Gly Gly Pro Gly Leu Glu Arg Gly Glu Ala Gly 
450 455 460 

Val Pro Ala Glu Phe Ser He Trp Thr Arg Glu Ala Gly Ala Gly Gly 
465 470 475 480 

Leu Ser He Ala Val Glu Gly Pro Ser Lys Ala Glu He Thr Phe Asp 
485 490 495 

Asp His Lys Asn Gly Ser Cys Gly Val Ser Tyr He Ala Gin Glu Pro 
500 505 510 

Gly Asn Tyr Glu Val Ser He Lys Phe Asn Asp Glu His He Pro Glu 
515 520 525 

Ser Pro Tyr Leu Val Pro Val He Ala Pro Ser Asp Asp Ala Arg Arg 
530 535 540 

Leu Thr Val Met Ser Leu Gin Glu Ser Gly Leu Lys Val Asn Gin Pro 
545 550 555 560 

Ala Ser Phe Ala He Arg Leu Asn Gly Ala Lys Gly Lys He Asp Ala 
565 570 575 

Lys Val His Ser Pro Ser Gly Ala Val Glu Glu Cys His Val Ser Glu 
580 585 590 

Leu Glu Pro Asp Lys Tyr Ala Val Arg Phe He Pro His Glu Asn Gly 
595 600 605 

Val His Thr He Asp Val Lys Phe Asn Gly Ser His Val Val Gly Ser 
610 615 620 



BNSDOC1D: <WO 9725423A1_I_> 



WO 97/25423 



PCT/US97/00100 



- 35 - 

Pro Phe Lys Val Arg Val Gly Glu Pro Gly Gin Ala Gly Asn Pro Ala 
625 630 635 640 

Leu Val Ser Ala Tyr Gly Thr Gly Leu Glu Gly Xaa Thr Thr Gly lie 
645 650 655 

Gin Ser Glu Phe Phe He Asn Thr Thr Arg Ala Gly Pro Gly Thr Leu 
660 665 670 

Ser Val Thr He Glu Gly Pro Ser Lys Val Lys Met Asp Cys Gin Glu 
675 680 685 

Thr Pro Glu Gly Tyr Lys Val Met Tyr Thr Pro Met Ala Pro Gly Asn 
690 695 700 

Tyr Leu He Ser Val Lys Tyr Gly Gly Pro Asn His He Val Gly Ser 
705 710 715 720 

Pro Phe Lys Ala Lys Val Thr Gly Gin Arg Leu Val Ser Pro Gly Ser 
725 730 735 

Ala Asn Glu Thr Ser Ser He Leu Val Glu Ser Val Thr Arg Ser Ser 
740 745 750 

Thr Glu Thr Cys Tyr Ser Ala lie Pro Lys Ala Ser Ser Asp Ala Ser 
755 760 765 

Lys Val Thr Ser Lys Gly Ala Gly Leu Ser Lys Ala Phe Val Gly Gin 
770 775 780 

Lys Ser Ser Phe Leu Val Asp Cys Ser Lys Ala Gly Ser Asn Met Leu 
785 790 795 800 

Leu He Gly Val His Gly Pro Thr Thr Pro Cys Glu Glu Val Ser Met 
805 810 815 

Lys His Val Gly Asn Gin Gin Tyr Asn Val Thr Tyr Val Val Lys Glu 
820 825 830 

Arg Gly Asp Tyr Val Leu Ala Val Lys Trp Gly Glu Glu His He Pro 
835 840 845 

Gly Ser Pro Phe His Val Thr Val Pro 
850 855 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
CGGATCCTCG GATACCGGCT CTCGGTGAAG 3 0 
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(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
CGGCTCCTCA GAGAGTGGGA CTGTCTGCCT 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

<ix) FEATURE: 

(A) NAME/KEY : CDS 

(B) LOCATION: 1..534 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

AAG GTG AAG ATG GAT TGC CAG GAG TGC CCT GAG GGC TAC CGC GTC ACC 4 8 

Lys Val Lys Met Asp Cys Gin Glu Cys Pro Glu Gly Tyr Arg Val Thr 
15 10 15 

TAT ACC CCC ATG GCA CCT GGC AGC TAC CTC ATC TCC ATC AAG TAC GGC 96 
Tyr Thr Pro Met Ala Pro Gly Ser Tyr Leu lie Ser lie Lys Tyr Gly 
20 25 30 

GGC CCC TAC CAC ATT GGG GGC AGC CCC TTC AAG GCC AAA GTC ACA GGC 144 
Gly Pro Tyr His lie Gly Gly Ser Pro Phe Lys Ala Lys Val Thr Gly 
35 40 45 

CCC CGT CTC GTC AGC AAC CAC AGC CTC CAC GAG ACA TCA TCA GTG TTT 19 2 

Pro Arg Leu Val Ser Asn His Ser Leu His Glu Thr Ser Ser Val Phe 
50 55 60 

GTA GAC TCT CTG ACC AAG GCC ACC TGT GCC CCC CAG CAT GGG GCC CCG 24 0 

Val Asp Ser Leu Thr Lys Ala Thr Cys Ala Pro Gin His Gly Ala Pro 
65 70 75 80 

GGT CCT GGG CCT GCT GAC GCC AGC AAG GTG GTG GCC AAG GGC CTG GGG 2 88 

Gly Pro Gly Pro Ala Asp Ala Ser Lys Val Val Ala Lys Gly Leu Gly 
85 90 95 

CTG AGC AAG GCC TAC GTA GGC CAG AAG AGC AGC TTC ACA GTA GAC TGC 3 36 

Leu Ser Lys Ala Tyr Val Gly Gin Lys Ser Ser Phe Thr Val Asp Cys 
100 105 no 

AGC AAA GCA GGC AAC AAC ATG CTG CTG GTG GGG GTT CAT GGC CCA AGG 3 84 

Ser Lys Ala Gly Asn Asn Met Leu Leu Val Gly Val His Gly Pro Arg 
115 120 125 

ACC CCC TGC GAG GAG ATC CTG GTG AAG CAC GTG GGC AGC CGG CTC TAC 4 32 

Thr Pro Cys Glu Glu lie Leu Val Lys His Val Gly Ser Arg Leu Tyr 
130 135 140 
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AGC GTG TCC TAC CTG CTC AAG GAC AAG GGG GAG TAC ACA CTG GTG GTC 4 80 

Ser Val Ser Tyr Leu Leu Lys Asp Lys Gly Glu Tyr Thr Leu Val Val 
145 150 155 160 

AAA TGG GGG GAC GAG CAC ATC CCA GGC AGN CCC TAC CGN GTT GTG GTG 52 8 

Lys Trp Gly Asp Glu His lie Pro Gly Xaa Pro Tyr Xaa Val Val Val 
165 170 175 

CCC TGAGTCTTGG GGCC 54 5 

Pro 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Lys Val Lys Met Asp Cys Gin Glu Cys Pro Glu Gly Tyr Arg Val Thr 
15 10 15 

Tyr Thr Pro Met Ala Pro Gly Ser Tyr Leu lie Ser lie Lys Tyr Gly 
20 25 30 

Gly Pro Tyr His lie Gly Gly Ser Pro Phe Lys Ala Lys Val Thr Gly 
35 40 45 

Pro Arg Leu Val Ser Asn His Ser Leu His Glu Thr Ser Ser Val Phe 
50 55 60 

Val Asp Ser Leu Thr Lys Ala Thr Cys Ala Pro Gin His Gly Ala Pro 
65 70 75 80 

Gly Pro Gly Pro Ala Asp Ala Ser Lys Val Val Ala Lys Gly Leu Gly 
85 90 95 

Leu Ser Lys Ala Tyr Val Gly Gin Lys Ser Ser Phe Thr Val Asp Cys 
100 105 110 

Ser Lys Ala Gly Asn Asn Met Leu Leu Val Gly Val His Gly Pro Arg 
115 120 125 

Thr Pro Cys Glu Glu lie Leu Val Lys His Val Gly Ser Arg Leu Tyr 
130 135 140 

Ser Val Ser Tyr Leu Leu Lys Asp Lys Gly Glu Tyr Thr Leu Val Val 
145 150 155 160 

Lys Trp Gly Asp Glu His lie Pro Gly Xaa Pro Tyr Xaa Val Val Val 
165 170 175 

Pro 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 3 67 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 172.. 8115 

<xi ) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

CGATCCGGGC GCCACCCCGC GGTCATCGGT CACCGGTCGC TCTCAGGAAC AGCAGCGCAA 60 

CCTCTGCTCC CTGCCTCGCC TCCCGCGCGC CTAGGTGCCT GCGACTTTAA TTAAAGGGCC 120 

GTCCCCTCGC CGAGGCTGCA GCACCGCCCC CCCGGCTTCT CGCGCCTCAA A ATG AGT 177 

Met Ser 
1 

AGC TCC CAC TCT CGG GCG GGC GAG AGC GCA GCA GGC GCG GCT CCG GGC 225 
Ser Ser His Ser Arg Ala Gly Gin Ser Ala Ala Gly Ala Ala Pro Gly 
5 10 is 

GGC GGC GTC GAC ACG CGG GAC GCC GAG ATG CCG GCC ACC GAG AAG GAC 2 73 

Gly Gly Val Asp Thr Arg Asp Ala Glu Met Pro Ala Thr Glu Lys Asp 
20 25 30 

CTG GCG GAG GAC GCG CCG TGG AAG AAG ATC CAG CAG AAC ACT TTC ACG 321 
Leu Ala Glu Asp Ala Pro Trp Lys Lys lie Gin Gin Asn Thr Phe Thr 
35 40 45 50 

CGC TGG TGC AAC GAG CAC CTG AAG TGC GTG AGC AAG CGC ATC GCC AAC 3 69 

Arg Trp Cys Asn Glu His Leu Lys Cys Val Ser Lys Arg lie Ala Asn 
55 60 65 

CTG CAG ACG GAC CTG AGC GAC GGG CTG CGG CTT ATC GCG CTG TTG GAG 4 17 

Leu Gin Thr Asp Leu Ser Asp Gly Leu Arg Leu lie Ala Leu Leu Glu 
70 75 80 

GTG CTC AGC CAG AAG AAG ATG CAC CGC AAG CAC AAC CAG CGG CCC ACT 4 65 

Val Leu Ser Gin Lys Lys Met His Arg Lys His Asn Gin Arg Pro Thr 
85 90 95 

TTC CGC CAA ATG CAG CTT GAG AAC GTG TCG GTG GCG CTC GAG TTC CTG 513 
Phe Arg Gin Met Gin Leu Glu Asn Val Ser Val Ala Leu Glu Phe Leu 
100 105 110 

GAC CGC GAG AGC ATC AAA CTG GTG TCC ATC GAC AGC AAG GCC ATC GTG 561 
Asp Arg Glu Ser lie Lys Leu Val Ser lie Asp Ser Lys Ala lie Val 
115 120 125 130 

GAC GGG AAC CTG AAG CTG ATC CTG GGC CTC ATC TGG ACC CTG ATC CTG 609 
Asp Gly Asn Leu Lys Leu lie Leu Gly Leu lie Trp Thr Leu He Leu 
135 140 145 

CAC TAG TCC ATC TCC ATG CCC ATG TGG GAC GAG GAG GAG GAT GAG GAG 657 
His Tyr Ser He Ser Met Pro Met Trp Asp Glu Glu Glu Asp Glu Glu 
150 155 160 

GCC AAG AAG CAG ACC CCC AAG CAG AGG CTC CTG GGC TGG ATC CAG AAC 7 05 

Ala Lys Lys Gin Thr Pro Lys Gin Arg Leu Leu Gly Trp He Gin Asn 
165 170 175 

AAG CTG CCG CAG CTG CCC ATC ACC AAC TTC AGC CGG GAC TGG CAG AGC 75 3 

Lys Leu Pro Gin Leu Pro He Thr Asn Phe Ser Arg Asp Trp Gin Ser 
180 185 190 

GGC CGG GCC CTG GGC GCC CTG GTG GAC AGC TGT GCC CCG GGC CTG TGT 801 
Gly Arg Ala Leu Gly Ala Leu Val Asp Ser Cys Ala Pro Gly Leu Cys 
195 200 205 210 
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CCT GAC TGG GAC TCT TGG GAC GCC AGC AAG CCC GTT ACC AAT GCG CGA 84 9 

Pro Asp Trp Asp Ser Trp Asp Ala Ser Lys Pro Val Thr Asn Ala Arg 
215 220 225 

GAG GCC ATG CAG CAG GCG GAT GAC TGG CTG GGC ATC CCC CAG GTG ATC 89 7 

Glu Ala Met Gin Gin Ala Asp Asp Trp Leu Gly lie Pro Gin Val lie 
230 235 240 

ACC CCC GAG GAG ATT GTG GAC CCC AAC GTG GAC GAG CAC TCT GTC ATG 94 5 

Thr Pro Glu Glu lie Val Asp Pro Asn Val Asp Glu His Ser Val Met 
245 250 255 

ACC TAC CTG TCC CAG TTC CCC AAG GCC AAG CTG AAG CCA GGG GCT CCC 99 3 

Thr Tyr Leu Ser Gin Phe Pro Lys Ala Lys Leu Lys Pro Gly Ala Pro 
260 265 270 

TTG CGC CCC AAA CTG AAC CCG AAG AAA GCC CGT GCC TAC GGG CCA GGC 1041 
Leu Arg Pro Lys Leu Asn Pro Lys Lys Ala Arg Ala Tyr Gly Pro Gly 
275 280 285 290 

ATC GAG CCC ACA GGC AAC ATG GTG AAG AAG CGG GCA GAG TTC ACT GTG 1089 
He Glu Pro Thr Gly Asn Met Val Lys Lys Arg Ala Glu Phe Thr Val 
295 300 305 

GAG ACC AGA AGT GCT GGC CAG GGA GAG GTG CTG GTG TAC GTG GAG GAC 113 7 

Glu Thr Arg Ser Ala Gly Gin Gly Glu Val Leu Val Tyr Val Glu Asp 
310 315 320 

CCG GCC GGA CAC CAG GAG GAG GCA AAA GTG ACC GCC AAT AAC GAC AAG 1185 
Pro Ala Gly His Gin Glu Glu Ala Lys Val Thr Ala Asn Asn Asp Lys 
325 330 335 

AAC CGC ACC TTC TCC GTC TGG TAC GTC CCC GAG GTG ACG GGG ACT CAT 12 3 3 

Asn Arg Thr Phe Ser Val Trp Tyr Val Pro Glu Val Thr Gly Thr His 
340 345 350 

AAG GTT ACT GTG CTC TTT GCT GGC CAG CAC ATC GCC AAG AGC CCC TTC 12 81 

Lys Val Thr Val Leu Phe Ala Gly Gin His He Ala Lys Ser Pro Phe 
355 360 365 370 

GAG GTG TAC GTG GAT AAG TCA CAG GGT GAC GCC AGC AAA GTG ACA GCC 1329 
Glu Val Tyr Val Asp Lys Ser Gin Gly Asp Ala Ser Lys Val Thr Ala 
375 380 385 

CAA GGT CCC GGC CTG GAG CCC AGT GGC AAC ATC GCC AAC AAG ACC ACC 13 77 

Gin Gly Pro Gly Leu Glu Pro Ser Gly Asn He Ala Asn Lys Thr Thr 
390 395 400 

TAC TTT GAG ATC TTT ACG GCA GGA GCT GGC ACG GGC GAG GTC GAG GTT 1425 
Tyr Phe Glu He Phe Thr Ala Gly Ala Gly Thr Gly Glu Val Glu Val 
405 410 415 

GTG ATC CAG GAC CCC ATG GGA CAG AAG GGC ACG GTA GAG CCT CAG CTG 14 7 3 

Val He Gin Asp Pro Met Gly Gin Lys Gly Thr Val Glu Pro Gin Leu 
420 425 430 

GAG GCC CGG GGC GAC AGC ACA TAC CGC TGC AGC TAC CAG CCC ACC ATG 15 21 

Glu Ala Arg Gly Asp Ser Thr Tyr Arg Cys Ser Tyr Gin Pro Thr Met 
435 440 445 450 

GAG GGC GTC CAC ACC GTG CAC GTC ACG TTT GCC GGC GTG CCC ATC CCT 15 6 9 

Glu Gly Val His Thr Val His Val Thr Phe Ala Gly Val Pro He Pro 
455 460 465 

CGC AGC CCC TAC ACT GTC ACT GTT GGC CAA GCC TGT AAC CCG AGT GCC 1617 
Arg Ser Pro Tyr Thr Val Thr Val Gly Gin Ala Cys Asn Pro Ser Ala 
470 475 480 
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TGC CGG GCG GTT GGC CGG GGC CTC CAG CCC AAG GGT GTG CGG GTG AAG 166 5 

Cys Arg Ala Val Gly Arg Gly Leu Gin Pro Lys Gly Val Arg Val Lys 
485 490 495 

GAG ACA GCT GAC TTC AAG GTG TAC ACA AAG GGC GCT GGC AGT GGG GAG 1713 
Glu Thr Ala Asp Phe Lys Val Tyr Thr Lys Gly Ala Gly Ser Gly Glu 
500 505 510 

CTG AAG GTC ACC GTG AAG GGC CCC AAG GGA GAG GAG CGC GTG AAG CAG 1761 
Leu Lys Val Thr Val Lys Gly Pro Lys Gly Glu Glu Arg Val Lys Gin 
515 520 525 530 

AAG GAC CTG GGG GAT GGC GTG TAT GGC TTC GAG TAT TAC CCC ATG GTC 1809 
Lys Asp Leu Gly Asp Gly Val Tyr Gly Phe Glu Tyr Tyr Pro Met Val 
535 540 545 

CCT GGA ACC TAT ATC GTC ACC ATC ACG TGG GGT GGT CAG AAC ATC GGG 18 57 

Pro Gly Thr Tyr He Val Thr He Thr Trp Gly Gly Gin Asn He Gly 
550 555 560 



CGC AGT CCC TTC GAA GTG AAG GTG GGC ACC GAG TGT GGC AAT CAG AAG 1905 
Arg Ser Pro Phe Glu Val Lys Val Gly Thr Glu Cys Gly Asn Gin Lys 
565 570 575 

GTA CGG GCC TGG GGC CCT GGG CTG GAG GGC GGC GTC GTT GGC AAG TCA 19 53 

Val Arg Ala Trp Gly Pro Gly Leu Glu Gly Gly Val Val Gly Lys Ser 
580 585 590 

GCA GAC TTT GTG GTG GAG GCT ATC GGG GAC GAC GTG GGC ACG CTG GGC 2 001 

Ala Asp Phe Val Val Glu Ala He Gly Asp Asp Val Gly Thr Leu Gly 
595 600 605 610 

TTC TCG GTG GAA GGG CCA TCG CAG GCT AAG ATC GAA TGT GAC GAC AAG 2 04 9 

Phe Ser Val Glu Gly Pro Ser Gin Ala Lys He Glu Cys Asp Asp Lys 
615 620 625 

GGC GAC GGC TCC TGT GAT GTG CGC TAC TGG CCG CAG GAG GCT GGC GAG 2 09 7 

Gly Asp Gly Ser Cys Asp Val Arg Tyr Trp Pro Gin Glu Ala Gly Glu 
630 635 640 

TAT GCC GTT CAC GTG CTG TGC AAC AGC GAA GAC ATC CGC CTC AGC CCC 214 5 

Tyr Ala Val His Val Leu Cys Asn Ser Glu Asp He Arg Leu Ser Pro 
645 650 655 

TTC ATG GCT GAC ATC CGT GAC GCG CCC CAG GAC TTC CAC CCA GAC AGG 219 3 

Phe Met Ala Asp He Arg Asp Ala Pro Gin Asp Phe His Pro Asp Arg 
660 665 670 

GTG AAG GCA CGT GGG CCT GGA TTG GAG AAG ACA GGT GTG GCC GTC AAC 2241 
Val Lys Ala Arg Gly Pro Gly Leu Glu Lys Thr Gly Val Ala Val Asn 
675 680 685 690 

AAG CCA GCA GAG TTC ACA GTG GAT GCC AAG CAC GGT GGC AAG GCC CCA 2289 
Lys Pro Ala Glu Phe Thr Val Asp Ala Lys His Gly Gly Lys Ala Pro 
695 700 705 

CTT CGG GTC CAA GTC CAG GAC AAT GAA GGC TGC CCT GTG GAG GCG TTG 2337 
Leu Arg Val Gin Val Gin Asp Asn Glu Gly Cys Pro Val Glu Ala Leu 
710 715 720 

GTC AAG GAC AAC GGC AAT GGC ACT TAC AGC TGC TCC TAC GTG CCC AGG 2385 
Val Lys Asp Asn Gly Asn Gly Thr Tyr Ser Cys Ser Tyr Val Pro Arg 
725 730 735 
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AAG CCG GTG AAG CAC ACA GCC ATG GTG TCC TGG GGA GGC GTC AGC ATC 24 3 3 

Lys Pro Val Lys His Thr Ala Met Val Ser Trp Gly Gly Val Ser lie 
740 745 750 

CCC AAC AGC CCC TTC AGG GTG AAT GTG GGA GCT GGC AGC CAC CCC AAC 24 81 

Pro Asn Ser Pro Phe Arg Val Asn Val Gly Ala Gly Ser His Pro Asn 
755 760 765 770 

AAG GTC AAA GTA TAC GGC CCC GGA GTA GCC AAG ACA GGG CTC AAG GCC 2 529 

Lys Val Lys Val Tyr Gly Pro Gly Val Ala Lys Thr Gly Leu Lys Ala 
775 780 785 

CAC GAG CCC ACC TAC TTC ACT GTG GAC TGC GCC GAG GCT GGC CAG GGG 2 577 

His Glu Pro Thr Tyr Phe Thr Val Asp Cys Ala Glu Ala.Gly Gin Gly 
790 795 800 

GAC GTC AGC ATC GGC ATC AAG TGT GCC CCT GGA GTG GTA GGC CCC GCC 262 5 

Asp Val Ser lie Gly lie Lys Cys Ala Pro Gly Val Val Gly Pro Ala 
805 810 815 

GAA GCT GAC ATC GAC TTC GAC ATC ATC CGC AAT GAC AAT GAC ACC TTC 2673 
Glu Ala Asp lie Asp Phe Asp lie lie Arg Asn Asp Asn Asp Thr Phe 
820 825 830 

ACG GTC AAG TAC ACG CCC CGG GGG GCT GGC AGC TAC ACC ATT ATG GTC 2 721 

Thr Val Lys Tyr Thr Pro Arg Gly Ala Gly Ser Tyr Thr lie Met Val 
835 840 845 850 

CTC TTT GCT GAC CAG GCC ACG CCC ACC AGC CCC ATC CGA GTC AAG GTG 2 7 69 

Leu Phe Ala Asp Gin Ala Thr Pro Thr Ser Pro lie Arg Val Lys Val 
855 860 865 

GAG CCC TCT CAT GAC GCC AGT AAG GTG AAG GCC GAG GGC CCT GGC CTC 2 817 

Glu Pro Ser His Asp Ala Ser Lys Val Lys Ala Glu Gly Pro Gly Leu 
870 875 880 

AGT CGC ACT GGT GTC GAG CTT GGC AAG CCC ACC CAC TTC ACA GTA AAT 2 865 

Ser Arg Thr Gly Val Glu Leu Gly Lys Pro Thr His Phe Thr Val Asn 
885 890 895 

GCC AAA GCT GCT GGC AAA GGC AAG CTG GAC GTC CAG TTC TCA GGA CTC 2913 
Ala Lys Ala Ala Gly Lys Gly Lys Leu Asp Val Gin Phe Ser Gly Leu 
900 905 910 

ACC AAG GGG GAT GCA GTG CGA GAT GTG GAC ATC ATC GAC CAC CAT GAC 29 61 

Thr Lys Gly Asp Ala Val Arg Asp Val Asp lie lie Asp His His Asp 
915 920 925 930 

AAC ACC TAC ACA GTC AAG TAC ACG CCT GTC CAG CAG GGT CCA GTA GGC 3009 
Asn Thr Tyr Thr Val Lys Tyr Thr Pro Val Gin Gin Gly Pro Val Gly 
935 940 945 

GTC AAT GTC ACT TAT GGA GGG GAT CCC ATC CCT AAG AGC CCT TTC TCA 3 057 

Val Asn Val Thr Tyr Gly Gly Asp Pro lie Pro Lys Ser Pro Phe Ser 
950 955 960 

GTG GCA GTA TCT CCA AGC CTG GAC CTC AGC AAG ATC AAG GTG TCT GGC 3105 
Val Ala Val Ser Pro Ser Leu Asp Leu Ser Lys lie Lys Val Ser Gly 
965 970 975 

CTG GGA GAG AAG GTG GAC GTT GGC AAA GAC CAG GAG TTC ACA GTC AAA 315 3 

Leu Gly Glu Lys Val Asp Val Gly Lys Asp Gin Glu Phe Thr Val Lys 
980 985 990 

TCA AAG GGT GCT GGT GGT CAA GGC AAA GTG GCA TCC AAG ATT GTG GGC 3 201 

Ser Lys Gly Ala Gly Gly Gin Gly Lys Val Ala Ser Lys lie Val Gly 
995 1000 1005 1010 
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CCC TCG GGT GCA GCG GTG CCC TGC AAG GTG GAG CCA GGC CTG GGG GCT 3 24 9 

Pro Ser Gly Ala Ala Val Pro Cys Lys Val Glu Pro Gly Leu Gly Ala 
1015 1020 1025 

GAC AAC AGT GTG GTG CGC TTC CTG CCC CGT GAG GAA GGG CCC TAT GAG 3 29 7 

Asp Asn Ser Val Val Arg Phe Leu Pro Arg Glu Glu Gly Pro Tyr Glu 
1030 1035 1040 

GTG GAG GTG ACC TAT GAC GGC GTG CCC GTG CCT GGC AGC CCC TTT CCT 3 34 5 

Val Glu Val Thr Tyr Asp Gly Val Pro Val Pro Gly Ser Pro Phe Pro 
1045 1050 1055 

CTG GAA GCT GTG GCC CCC ACC AAG CCT AGC AAG GTG AAG GCG TTT GGG 3 39 3 

Leu Glu Ala Val Ala Pro Thr Lys Pro Ser Lys Val Lys Ala Phe Gly 
1060 1065 1070 

CCG GGG CTG CAG GGA GGC AGT GCG GGC TCC CCC GCC CGC TTC ACC ATC 34 41 

Pro Gly Leu Gin Gly Gly Ser Ala Gly Ser Pro Ala Arg Phe Thr lie 
1075 1080 1085 1090 

GAC ACC AAG GGC GCC GGC ACA GGT GGC CTG GGC CTG ACG GTG GAG GGC 34 89 

Asp Thr Lys Gly Ala Gly Thr Gly Gly Leu Gly Leu Thr Val Glu Gly 
1095 1100 1105 

CCC TGT GAG GCG CAG CTC GAG TGC TTG GAC AAT GGG GAT GGC ACA TGT 3537 
Pro Cys Glu Ala Gin Leu Glu Cys Leu Asp Asn Gly Asp Gly Thr Cys 
1110 1115 1120 

TCC GTG TCC TAC GTG CCC ACC GAG CCC GGG GAC TAC AAC ATC AAC ATC 3 585 

Ser Val Ser Tyr Val Pro Thr Glu Pro Gly Asp Tyr Asn lie Asn lie 
1125 1130 1135 

CTC TTC GCT GAC ACC CAC ATC CCT GGC TCC CCA TTC AAG GCC CAC GTG 3 63 3 

Leu Phe Ala Asp Thr His lie Pro Gly Ser Pro Phe Lys Ala His Val 
1140 1145 1150 

GTT CCC TGC TTT GAC GCA TCC AAA GTC AAG TGC TCA GGC CCC GGG CTG 3 681 

Val Pro Cys Phe Asp Ala Ser Lys Val Lys Cys Ser Gly Pro Gly Leu 
1155 1160 1165 1170 

GAG CGG GCC ACC GCT GGG GAG GTG GGC CAA TTC CAA GTG GAC TGC TCG 3 729 

Glu Arg Ala Thr Ala Gly Glu Val Gly Gin Phe Gin Val Asp Cys Ser 
1175 1180 1185 

AGC GCG GGC AGC GCG GAG CTG ACC ATT GAG ATC TGC TCG GAG GCG GGG 3 77 7 

Ser Ala Gly Ser Ala Glu Leu Thr lie Glu lie Cys Ser Glu Ala Gly 
119C 1195 1200 

CTT CCG GCC GAG GTG TAC ATC CAG GAC CAC GGT GAT GGC ACG CAC ACC 3 82 5 

Leu Pro Ala Glu Val Tyr lie Gin Asp His Gly Asp Gly Thr His Thr 
1205 1210 1215 

ATT ACC TAC ATT CCC CTC TGC CCC GGG GCC TAC ACC GTC ACC ATC AAG 3 873 

lie Thr Tyr lie Pro Leu Cys Pro Gly Ala Tyr Thr Val Thr lie Lys 
1220 1225 1230 

TAC GGC GGC CAG CCC GTG CCC AAC TTC CCC AGC AAG CTG CAG GTG GAA 3921 
Tyr Gly Gly Gin Pro Val Pro Asn Phe Pro Ser Lys Leu Gin Val Glu 
1235 1240 1245 1250 

CCT GCG GTG GAC ACT TCC GGT GTC CAG TGC TAT GGG CCT GGT ATT GAG 3969 
Pro Ala Val Asp Thr Ser Gly Val Gin Cys Tyr Gly Pro Gly lie Glu 
1255 1260 1265 

GGC CAG GGT GTC TTC CGT GAG GCC ACC ACT GAG TTC AGT GTG GAC GCC 4 017 

Gly Gin Gly Val Phe Arg Glu Ala Thr Thr Glu Phe Ser Val Asp Ala 
1270 1275 1280 
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CGG GCT CTG ACA CAG ACC GGA GGG CCG CAC GTC AAG GCC CGT GTG GCC 4 065 

Arg Ala Leu Thr Gin Thr Gly Gly Pro His Val Lys Ala Arg Val Ala 
1285 1290 1295 

AAC CCC TCA GGC AAC CTG ACG GAG ACC TAC GTT CAG GAC CGT GGC GAT .4113 
Asn Pro Ser Gly Asn Leu Thr Glu Thr Tyr Val Gin Asp Arg Gly Asp 
1300 1305 1310 

GGC ATG TAC AAA GTG GAG TAC ACG CCT TAC GAG GAG GGA CTG CAC TCC 4161 
Gly Met Tyr Lys Val Glu Tyr Thr Pro Tyr Glu Glu Gly Leu His Ser 
1315 1320 1325 1330 

GTG GAC GTG ACC TAT GAC GGC AGT CCC GTG CCC AGC AGC CCC TTC CAG 4209 
Val Asp Val Thr Tyr Asp Gly Ser Pro Val Pro Ser Ser Pro Phe Gin 
1335 1340 1345 

GTG CCC GTG ACC GAG GGC TGC GAC CCC TCC CGG GTG CGT GTC CAC GGG 4257 
Val Pro Val Thr Glu Gly Cys Asp Pro Ser Arg Val Arg Val His Gly 
1350 1355 1360 

CCA GGC ATC CAA AGT GGC ACC ACC AAC AAG CCC AAC AAG TTC ACT GTG 4305 
Pro Gly lie Gin Ser Gly Thr Thr Asn Lys Pro Asn Lys Phe Thr Val 
1365 1370 1375 

GAG ACC AGG GGA GCT GGC ACG GGC GGC CTG GGC CTG GCT GTA GAG GGC 4353 
Glu Thr Arg Gly Ala Gly Thr Gly Gly Leu Gly Leu Ala Val Glu Gly 
1380 1385 1390 

CCC TCC GAG GCC AAG ATG TCC TGC ATG GAT AAC AAG GAC GGC AGC TGC 44 01 

Pro Ser Glu Ala Lys Met Ser Cys Met Asp Asn Lys Asp Gly Ser Cys 
1395 1400 1405 1410 

TCG GTC GAG TAC ATC CCT TAT GAG GCT GGC ACC TAC AGC CTC AAC GTC 4449 
Ser Val Glu Tyr lie Pro Tyr Glu Ala Gly Thr Tyr Ser Leu Asn Val 7 
1415 1420 1425 

ACC TAT GGT GGC CAT CAA GTG CCA GGC AGT CCT TTC AAG GTC CCT GTG 44 97 

Thr Tyr Gly Gly His Gin Val Pro Gly Ser Pro Phe Lys Val Pro Val 
1430 1435 1440 

CAT GAT GTG ACA GAT GCG TCC AAG GTC AAG TGC TCT GGG CCC GGC CTG I 545 
His Asp Val Thr Asp Ala Ser Lys Val Lys Cys Ser Gly Pro Gly Leu 
1445 1450 1455 

AGC CCA GGC ATG GTT CGT GCC AAC CTC CCT CAG TCC TTC CAG GTG GAC 4 59 3 

Ser Pro Gly Met Val Arg Ala Asn Leu Pro Gin Ser Phe Gin Val Asp 
1460 1465 1470 

ACA AGC AAG GCT GGT GTG GCC CCA TTG CAG GTC AAA GTG CAA GGG CCC 4 641 

Thr Ser Lys Ala Gly Val Ala Pro Leu Gin Val Lys Val Gin Gly Pro 
1475 1480 1485 1490 

AAA GGC CTG GTG GAG CCA GTG GAC GTG GTA GAC AAC GCT GAT GGC ACC 4 689 

Lys Gly Leu Val Glu Pro Val Asp Val Val Asp Asn Ala Asp Gly Thr 
1495 1500 1505 

CAG ACC GTC AAT TAT GTG CCC AGC CGA GAA GGG CCC TAC AGC ATC TCA 4 737 

Gin Thr Val Asn Tyr Val Pro Ser Arg Glu Gly Pro Tyr Ser lie Ser 
1510 1515 1520 

GTA CTG TAT GGA GAT GAA GAG GTA CCC CGG AGC CCC TTC AAG GTC AAG 4 785 

Val Leu Tyr Gly Asp Glu Glu Val Pro Arg Ser Pro Phe Lys Val Lys 
1525 1530 1535 

GTG CTG CCT ACT CAT GAT GCC AGC AAG GTG AAG GCC AGT GGC CCC GGG 4 83 3 

Val Leu Pro Thr His Asp Ala Ser Lys Val Lys Ala Ser Gly Pro Gly 
1540 1545 1550 
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CTC AAC ACC ACT GGC GTG CCT GCC AGC CTG CCC GTG GAG TTC ACC ATC 48 81 

Leu Asn Thr Thr Gly Val Pro Ala Ser Leu Pro Val Glu Phe Thr He 
1555 1560 1565 1570 

GAT GCA AAG GAC GCC GGG GAG GGC CTG CTG GCT GTC CAG ATC ACG GAT 4 929 

Asp Ala Lys Asp Ala Gly Glu Gly Leu Leu Ala Val Gin He Thr Asp 
1575 1580 1585 

4977 



5025 



CCC 
Pro 


GAA 
Glu 


GGC 
Gly 


AAG CCG 
Lys Pro 
1590 


AAG 

Lys 


AAG 
Lys 


ACA 
Thr 


CAC ATC 
His He 
1595 


CAA 
Gin 


GAC 
Asp 


AAC 
Asn 


CAT GAC 
His Asp 
1600 


GGC 
Gly 


ACG 
Thr 


TAT 
Tyr 


ACA GTG 
Thr Val 
1605 


GCC 
Ala 


TAC 
Tyr 


GTG 
Val 


CCA GAC 
Pro Asp 
1610 


GTG 
Val 


ACA 
Thr 


GGT 
Gly 


CGC TAC 
Arg Tyr 
1615 


ACC 
Thr 


ATC 
He 


CTC 
Leu 


ATC AAG 
He Lys 
1620 


TAC 
Tyr 


GGT 
Gly 


GGT 
Gly 


GAC GAG 
Asp Glu 
1625 


ATC 
He 


CCC 
Pro 


TTC 
Phe 


TCC CCG 
Ser Pro 
1630 


TAC 
Tyr 


CGC 
Arg 


GTG 
Val 


CGT GCC 
Arg Ala 
1635 


GTG 
Val 


CCC 
Pro 


ACC 
Thr 


GGG GAC 
Gly Asp 
1640 


GCC 
Ala 


AGC 
Ser 


AAG 
Lys 


TGC ACT 
Cys Thr 
1645 


GTC 
Val 


ACA 
Thr 


GTG 
Val 


TCA 
Ser 
1650 


ATC 
He 


GGA 
Gly 


GGT 
Gly 


CAC 
His 


GGG CTA 
Gly Leu 
1655 


GGT 
Gly 


GCT 
Ala 


GGC 
Gly 


ATC GGC 
He Gly 
1660 


CCC 
Pro 


ACC 
Thr 


ATT 
He 


CAG ATT 
Gin He 
1665 


GGG 
Gly 


GAG 
Glu 


GAG 
Glu 


ACG 
Thr 
167C 


GTG 
Val 


ATC 
He 


ACT 
Thr 


GTG 
Val 


GAC ACT 
Asp Thr 
1675 


AAG 

Lys 


GCG 
Ala 


GCA 
Ala 


GGC AAA 
Gly Lys 
1680 


GGC 
Gly 


AAA 


GTG 


ACG 


TGC 


ACC 


GTG 


TGC 


ACG 


CCT 


GAT 


GGC 


TCA 


GAG 


GTG 


GAT 


GTG 



5073 



5121 



5169 



5217 



5265 



Lys Val Thr Cys Thr Val Cys Thr Pro Asp Gly Ser Glu Val Asp Val 
1685 1690 1695 

GAC GTG GTG GAG AAT GAG GAC GGC ACT TTC GAC ATC TTC TAC ACG GCC 5313 
Asp Val Val Glu Asn Glu Asp Gly Thr Phe Asp He Phe Tyr Thr Ala 
1700 1705 1710 

CCC CAG CCG GGC AAA TAC GTC ATC TGT GTG CGC TTT GGT GGC GAG CAC 53 61 

Pro Gin Pro Gly Lys Tyr Val He Cys Val Arg Phe Gly Gly Glu His 
1715 1720 1725 1730 

GTG CCC AAC AGC CCC TTC CAA GTG ACG GCT CTG GCT GGG GAC CAG CCC 54 09 

Val Pro Asn Ser Pro Phe Gin Val Thr Ala Leu Ala Gly Asp Gin Pro 
1735 1740 1745 

TCG GTG CAG CCC CCT CTA CGG TCT CAG CAG CTG GCC CCA CAG TAC ACC 54 57 

Ser Val Gin Pro Pro Leu Arg Ser Gin Gin Leu Ala Pro Gin Tyr Thr 
1750 1755 1760 

TAC GCC CAG GGC GGC CAG CAG ACT TGG GCC CCG GAG AGG CCC CTG GTG 5505 
Tyr Ala Gin Gly Gly Gin Gin Thr Trp Ala Pro Glu Arg Pro Leu Val 
1765 1770 1775 

GGT GTC AAT GGG CTG GAT GTG ACC AGC CTG AGG CCC TTT GAC CTT GTC 555 3 

Gly Val Asn Gly Leu Asp Val Thr Ser Leu Arg Pro Phe Asp Leu Val 
1780 1785 1790 

ATC CCC TTC ACC ATC AAG AAG GGC GAG ATC ACA GGG GAG GTT CGG ATG 5601 
He Pro Phe Thr He Lys Lys Gly Glu He Thr Gly Glu Val Arg Met 
1795 1800 1805 1810 

CCC TCA GGC AAG GTG GCG CAG CCC ACC ATC ACT GAC AAC AAA GAC GGC 564 9 

Pro Ser Gly Lys Val Ala Gin Pro Thr He Thr Asp Asn Lys Asp Gly 
1315 1820 1825 
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ACC 
Thr 


GTG 
Val 


ACC 
Thr 


GTG CGG 
Val Arg 
1830 


TAT 

Tyr 


GCA 
Ala 


CCC 
Pro 


AGC 
Ser 
1835 


GAG 
Glu 


GCT 
Ala 


GGC 
Gly 


CTG 
Leu 


CAC 
His 
1840 


GAG 
Glu 

i 


ATG 
Met 


5697 


GAC 
Asp 


ATC 
He 


CGC 
Arg 
184E 


TAT 
Tyr 


GAC 
Asp 


AAC 
Asn 


ATG 
Met 


CAC ATC 
His He 
1850 


CCA 
Pro 


GGA 
Gly 


AGC 
Ser 


CCC TTG 
Pro Leu 
1855 


CAG 
Gin 


TTC 
Phe 


5745 


TAT 
Tyr 


GTG GAT 
Val Asp 
1860 


TAC 
Tyr 


GTC 
Val 


AAC 
Asn 


TGT GGC 
Cys Gly 
1865 


CAT 
His 


GTC 
Val 


ACT 
Thr 


GCC TAT 
Ala Tyr 
1870 


GGG 
Gly 


CCT 
Pro 


GGC 
Gly 


5793 


CTC ACC 
Leu Thr 
1875 


CAT 
His 


GGA 
Gly 


GTA 
Val 


GTG AAC 
Val Asn 
1880 


AAG 

Lys 


CCT 
Pro 


GCC 
Ala 


ACC TTC 
Thr Phe 
1885 


ACC 
Thr 


GTC 
Val 


AAC 
Asn 


ACC 
Thr 
1890 


5841 


AAG 
Lys 


GAT 
Asp 


GCA 
Ala 


GGA 
Gly 


GAG GGG 
Glu Gly 
1895 


GGC 
Gly 


CTG 
Leu 


TCT 
Ser 


CTG GCC 
Leu Ala 
1900 


ATT 
He 


GAG 
Glu 


GGC 
Gly 


CCG TCC 
Pro Ser 
1905 


5889 


AAA 

Lys 


GCA 
Ala 


GAA 
Glu 


ATC AGC 
He Ser 
1910 


TGC 
Cys 


ACT 
Thr 


GAC 
Asp 


AAC CAG 
Asn Gin 
1915 


GAT 
Asp 


GGG 
Gly 


ACA 
Thr 


TGC AGC 
Cys Ser 
1920 


GTG 
Val 


5937 


TCC 
Ser 


TAC 
Tyr 


CTG CCT 
Leu Pro 
1925 


GTG 
Val 


CTG 
Leu 


CCG 
Pro 


GGG GAC 
Gly Asp 
1930 


TAC 
Tyr 


AGC 
Ser 


ATT 

He 


CTA GTC 
Leu Val 
1935 


AAG 
Lys 


TAC 
Tyr 


5985 


AAT 
Asn 


GAA CAG 
Glu Gin 
1940 


CAC 
His 


GTC 
Val 


CCA 
Pro 


GGC AGC 
Gly Ser 
1945 


CCC 
Pro 


TTC 
Phe 


ACT 
Thr 


GCT CGG 
Ala Arg 
1950 


GTC 
Val 


ACA 
Thr 


GGT 
Gly 


6033 


GAC GAC 
Asp Asp 
1955 


TCC 
Ser 


ATG 
Met 


CGT 
Arg 


ATG TCC 
Met Ser 
1960 


CAC 
His 


CTA 
Leu 


AAG 

Lys 


GTC GGC 
Val Gly 
1965 


TCT 
Ser 


GCT 
Ala 


GCC 
Ala 


GAC 
Asp 
1970 


6 081 


ATC 
He 


CCC 
Pro 


ATC 
He 


AAC 
Asn 


ATC TCA 
He Ser 
1975 


GAG 
Glu 


ACG 
Thr 


GAT 
Asp 


CTC AGC 
Leu Ser 
1980 


CTG 
Leu 


CTG 
Leu 


ACG 
Thr 


GCC ACT 
Ala Thr 
1985 


6129 


GTG 

Val 


GTC 
Val 


CCG 
Pro 


CCC TCG 
Pro Ser 
1990 


GGC 
Gly 


CGG 
Arg 


GAG 
Glu 


GAG CCC 
Glu Pro 
1995 


TGT 

Cys 


TTG 
Leu 


CTG 
Leu 


AAG CGG 
Lys Arg 
2000 


CTG 
Leu 


6177 


CGT 
Arg 


AAT 
Asn 


GGC CAC 
Gly His 
2005 


GTG 
Val 


GGG 
Gly 


ATT 

He 


TCA TTC 
Ser Phe 
2010 


GTG 
Val 


CCC 
Pro 


AAG 
Lys 


GAG ACG 
Glu Thr 
2015 


GGG 
Gly 


GAG 
Glu 


6225 


CAC 
His 


CTG GTG 
Leu Val 
2020 


CAT 
His 


GTG 
val 


AAG 
Lys 


AAA AAT 
Lys Asn 
2025 


GGC 
Gly 


CAG 
Gin 


CAC 
His 


GTG GCC 
Val Ala 
2030 


AGC 
Ser 


AGC 
Ser 


CCC 
Pro 


6273 


ATC CCG 
He Pro 
2035 


GTG 
Val 


GTG 
Val 


ATC 
He 


AGC CAG 
Ser Gin 
2040 


TCG 
Ser 


GAA 

Glu 


ATT 

He 


GGG GAT 
Gly Asp 
2045 


GCC 
Ala 


AGT 

Ser 


CGT 
Arg 


GTT 
Val 

2050 


6321 


CGG 
Arg 


GTC 
Val 


TCT 
Ser 


GGT 
Gly 


CAG GGC 
Gin Gly 
2055 


CTT 
Leu 


CAC 
His 


GAA 
Glu 


GGC CAC 
Gly His 
2060 


ACC 
Thr 


TTT 
Phe 


GAG 
Glu 


CCT GCA 
Pro Ala 
2065 


6369 


GAG 
Glu 


TTT 
Phe 


ATC 
He 


ATT GAT 
He Asp 
2070 


ACC 
Thr 


CGC 
Arg 


GAT 
Asp 


GCA GGC TAT 
Ala Gly Tyr 
2075 


GGT 
Gly 


GGG 
Gly 


CTC AGC 
Leu Ser 
2080 


CTG 
Leu 


6417 


TCC 


ATT 


GAG 


GGC 


CCC 


AGC 


AAG 


GTG 


GAC 


ATC 


AAC 


ACA 


GAG 


GAC 


CTG 


GAG 


6465 



Ser He Glu Gly Pro Ser Lys Val Asp He Asn Thr Glu Asp Leu Glu 
2085 2090 2095 
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GAC GGG ACG TGC AGG GTC ACC TAC TGC CCC ACA GAG CCA GGC AAC TAC 6513 
Asp Gly Thr Cys Arg Val Thr Tyr Cys Pro Thr Glu Pro Gly Asn Tyr 
2100 2105 2110 

ATC ATC AAC ATC AAG TTT GCC GAC CAG CAC GTG CCT GGC AGC CCC TTC 65 61 

lie lie Asn lie Lys Phe Ala Asp Gin His Val Pro Gly Ser Pro Phe 
2115 2120 2125 2130 

TCT GTG AAG GTG ACA GGC GAG GGC CGG GTG AAA GAG AGC ATC ACC CGC 6 6 09 

Ser Val Lys Val Thr Gly Glu Gly Arg Val Lys Glu Ser lie Thr Arg 
2135 2140 2145 

AGG CGT CGG GCT CCT TCA GTG GCC AAC GTT GGT AGT CAT TGT GAC CTC 6 657 

Arg Arg Arg Ala Pro Ser Val Ala Asn Val Gly Ser His Cys Asp Leu 
2150 2155 2160 

AGC CTG AAA ATC CCT GAA ATT AGC ATC CAG GAT ATG ACA GCC CAG GTG 6 70 5 

Ser Leu Lys He Pro Glu He Ser Tie Gin Asp Met Thr Ala Gin Val 
2165 2170 2175 

ACC AGC CCA TCG GGC AAG ACC CAT GAG GCC GAG ATC GTG GAA GGG GAG 6753 
Thr Ser Pro Ser Gly Lys Thr His Glu Ala Glu He Val Glu Gly Glu 
2180 2185 2190 

AAC CAC ACC TAC TGC ATC CGC TTT GTT CCC GCT GAG ATG GGC ACA CAC 6 801 

Asn Hxs Thr Tyr Cys He Arg Phe Val Pro Ala Glu Met Gly Thr His 
2195 2200 2205 2210 

ACA GTC AGC GTC AAG TAC AAG GGC CAG CAC GTG CCT GGG AGC CCC TTC 6 84 9 

Thr Val Ser Val Lys Tyr Lys Gly Gin His Val Pro Gly Ser Pro Phe 
2215 2220 2225 

CAG TTC ACC GTG GGG CCC CTA GGG GAA GGG GGA GCC CAC AAG GTC CGA 6897 
Gin Phe Thr Val Gly Pro Leu Gly Glu Gly Gly Ala His Lys Val Arg 
2230 2235 2240 

GCT GGG GGC CCT GGC CTG GAG AGA GCT GAA GCT GGA GTG CCA GCC GAA 694 5 

Ala Gly Gly Pro Gly Leu Glu Arg Ala Glu Ala Gly Val Pro Ala Glu 
2245 2250 2255 

TTC AGT ATC TGG ACC CGG GAA GCT GGT GCT GGA GGC CTG GCC ATT GCT 69 9 3 

Phe Ser He Trp Thr Arg Glu Ala Gly Ala Gly Gly Leu Ala He Ala 
2260 2265 2270 

GTC GAG GGC CCC AGC AAG GCT GAG ATC TCT TTT GAG GAC CGC AAG GAC 7 041 

Val Glu Gly Pro Ser Lys Ala Glu He Ser Phe Glu Asp Arg Lys Asd 
22? 5 2280 2285 2290 

GGC TCC TGT GGT GTG GCT TAT GTG GTC CAG GAG CCA GGT GAC TAC GAA 7 089 

Gly Ser Cys Gly Val Ala Tyr Val Val Gin Glu Pro Gly Asp Tyr Glu 
2295 2300 2305 

GTC TCA- GTC AAG TTC AAC GAG GAA CAC ATT CCC GAC AGC CCC TTC GTG 713 7 

Val Ser Val Lys Phe Asn Glu Glu His He Pro Asp Ser Pro Phe Val 
2310 2315 2320 

GTG CCT GTG GCT TCT CCG TCT GGC GAC GCC CGC CGC CTC ACT GTT TCT 7185 
Val Pro Val Ala Ser Pro Ser Gly Asp Ala Arg Arg Leu Thr Val Ser 
2325 2330 2335 

AGC CTT CAG GAG TCA GGG CTA AAG GTC AAC CAG CCA GCC TCT TTT GCA 723 3 

Ser Leu Gin Glu Ser Gly Leu Lys Val Asn Gin Pro Ala Ser Phe Ala 
2340 2345 2350 

GTC AGC CTG AAC GGG GCC AAG GGG GCG ATC GAT GCC AAG GTG CAC AGC 72 81 

Val Ser Leu Asn Gly Ala Lys Gly Ala He Asp Ala Lys Val His Ser 
2355 2360 2365 2370 
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CCC TCA GGA GCC CTG GAG GAG TGC TAT GTC ACA GAA ATT GAC CAA GAT 73 29 

Pro Ser Gly Ala Leu Glu Glu Cys Tyr Val Thr Glu lie Asp Gin Asp 

2375 2380 2385 

AAG TAT GCT GTG CGC TTC ATC CCT CGG GAG AAT GGC GTT TAC CTG ATT 73 7 7 

Lys Tyr Ala Val Arg Phe lie Pro Arg Glu Asn Gly Val Tyr Leu lie 
2390 2395 2400 

GAC GTC AAG TTC AAC GGT ACC CAC ATC CCT GGA AGC CCC TTC AAG ATC 74 2 5 

Asp Val Lys Phe Asn Gly Thr His lie Pro Gly Ser Pro Phe Lys lie 
2405 2410 2415 

CGA GTT GGG GAG CCT GGG CAT GGA GGG GAC CCA GGC TTG GTG TCT GCT 74 7 3 

Arg Val Gly Glu Pro Gly His Gly Gly Asp Pro Gly Leu Val Ser Ala 
2420 2425 2430 

TAC GGA GCA GGT CTG GAA GGC GGT GTC ACA GGG AAC CCA GCT GAG TTC 7 521 

Tyr Gly Ala Gly Leu Glu Gly Gly Val Thr Gly Asn Pro Ala Glu Phe 
2435 2440 2445 2450 

GTC GTG AAC ACG AGC AAT GCG GGA GCT GGT GCC CTG TCG GTG ACC ATT 75 69 

Val Val Asn Thr Ser Asn Ala Gly Ala Gly Ala Leu Ser Val Thr lie 
2455 2460 2465 

GAC GGC CCC TCC AAG GTG AAG ATG GAT TGC CAG GAG TGC CCT GAG GGC 7 617 

Asp Gly Pro Ser Lys Val Lys Met Asp Cys Gin Glu Cys Pro Glu Gly 
2470 2475 2480 

TAC CGC GTC ACC TAT ACC CCC ATG GCA CCT GGC AGC TAC CTC ATC TCC 7665 
Tyr Arg Val Thr Tyr Thr Pro Met Ala Pro Gly Ser Tyr Leu lie Ser 
2485 2490 2495 

ATC AAG TAC GGC GGC CCC TAC CAC ATT GGG GGC AGC CCC TTC AAG GCC 771-3 
lie Lys Tyr Gly Gly Pro Tyr His lie Gly Gly Ser Pro Phe Lys Ala 
2500 2505 2510 

AAA GTC ACA GGC CCC CGT CTC GTC AGC AAC CAC AGC CTC CAC GAG ACA 7 7 61 

Lys Val Thr Gly Pro Arg Leu Val Ser Asn His Ser Leu His Glu Thr 
2515 2520 2525 2530 

TCA TCA GTG TTT GTA GAC TCT CTG ACC AAG GCC ACC TGT GCC CCC CAG 7 809 

Ser Ser Val Phe Val Asp Ser Leu Thr Lys Ala Thr Cys Ala Pro Gin 
2535 2540 2545 

CAT GGG GCC CCG GGT CCT GGG CCT GCT GAC GCC AGC AAG GTG GTG GCC 7 857 

His Gly Ala Pro Gly Pro Gly Pro Ala Asp Ala Ser Lys Val Val Ala 
2550 2555 2560 

AAG GGC CTG GGG CTG AGC AAG GCC TAC GTA GGC CAG AAG AGC AGC TTC 7905 
Lys Gly Leu Gly Leu Ser Lys Ala Tyr Val Gly Gin Lys Ser Ser Phe 
2565 2570 2575 

ACA GTA GAC TGC AGC AAA GCA GGC AAC AAC ATG CTG CTG GTG GGG GTT 79 53 

Thr Val Asp Cys Ser Lys Ala Gly Asn Asn Met Leu Leu Val Gly Val 
2580 2585 2590 

CAT GGC CCA AGG ACC CCC TGC GAG GAG ATC CTG GTG AAG CAC GTG GGC 8001 
His Gly Pro Arg Thr Pro Cys Glu Glu lie Leu Val Lys His Val Gly 
2595 2600 2605 2610 

AGC CGG CTC TAC AGC GTG TCC TAC CTG CTC AAG GAC AAG GGG GAG TAC 8 04 9 

Ser Arg Leu Tyr Ser Val Ser Tyr Leu Leu Lys Asp Lys Gly Glu Tyr 
2615 2620 2625 

ACA CTG GTG GTC AAA TGG GGG CAC GAG CAC ATC CCA GGC AGC CCC TAC 809 7 

Thr Leu Val Val Lys Trp Gly His Glu His lie Pro Gly Ser Pro Tyr 
2630 2635 2640 
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CGC GTT GTG GTG CCC TGAGTCTGGG GCCCGTGCCA GCCGGCAGCC CCCAAGCCTG 8152 
Arg Val Val Val Pro 
2645 

CCCCGCTACC CAAGCAGCCC CGCCCTCTTC CCCTCAACCC CGGCCCAGGC CGCCCTGGCC 8212 

GCCCGCCTGT CACTGCAGCT GCCCCTGCCC TGTGCCGTGC TGCGCTCACC TGCCTCCCCA 82 72 

GCCAGCCGCT GACCTCTCGG CTTTCACTTG GGCAGAGGGA GCCATTTGGT GGCGCTGCTT 8 3 32 

GTCTT CTTTG GTTCTGGGAG GGGTGAGGGA TGGGG 83 67 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 7 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: not relevant 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Ser Ser Ser His Ser Arg Ala Gly Gin Ser Ala Ala Gly Ala Ala 
15 10 15 

Pro Gly Gly Gly Val Asp Thr Arg Asp Ala Glu Met Pro Ala Thr Glu 
20 25 30 

Lys Asp Leu Ala Glu Asp Ala Pro Trp Lys Lys lie Gin Gin Asn Thr 
35 40 45 

Phe Thr Arg Trp Cys Asn Glu His Leu Lys Cys Val Ser Lys Arg lie 
50 55 60 

Ala Asn Leu Gin Thr Asp Leu Ser Asp Gly Leu Arg Leu lie Ala Leu 
65 70 75 80 

Leu Glu Val Leu Ser Gin Lys Lys Met His Arg Lys His Asn Gin Arg 
85 90 95 

Pro Thr Phe Arg Gin Met Gin Leu Glu Asn Val Ser Val Ala Leu Glu 
100 105 110 

Phe Leu Asp Arg Glu Ser lie Lys Leu Val Ser lie Asp Ser Lys Ala 
115 120 125 

He Val Asp Gly Asn Leu Lys Leu He Leu Gly Leu He Trp Thr Leu 
130 135 140 

He Leu His Tyr Ser He Ser Met Pro Met Trp Asp Glu Glu Glu Asp 
145 150 155 160 

Glu Glu Ala Lys Lys Gin Thr Pro Lys Gin Arg Leu Leu Gly Trp He 
165 170 175 

Gin Asn Lys Leu Pro Gin Leu Pro He Thr Asn Phe Ser Arg Asp Trp 
180 185 190 

Gin Ser Gly Arg Ala Leu Gly Ala Leu Val Asp Ser Cys Ala Pro Gly 
195 200 205 

Leu Cys Pro Asp Trp Asp Ser Trp Asp Ala Ser Lys Pro Val Thr Asn 
210 215 220 

Ala Arg Glu Ala Met Gin Gin Ala Asp Asp Trp Leu Gly He Pro Gin 
225 230 235 240 
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Val lie Thr Pro Glu Glu lie Val Asp Pro Asn Val Asp Glu His Ser 
245 250 255 

Val Met Thr Tyr Leu Ser Gin Phe Pro Lys Ala Lys Leu Lys Pro Gly 
260 265 270 

Ala Pro Leu Arg Pro Lys Leu Asn Pro Lys Lys Ala Arg Ala Tyr Gly 
275 280 285 

Pro Gly lie Glu Pro Thr Gly Asn Met Val Lys Lys Arg Ala Glu Phe 
290 295 300 

Thr Val Glu Thr Arg Ser Ala Gly Gin Gly Glu Val Leu Val Tyr Val 
305 310 315 320 

Glu Asp Pro Ala Gly His Gin Glu Glu Ala Lys Val Thr Ala Asn Asn 
325 330 335 

Asp Lys Asn Arg Thr Phe Ser Val Trp Tyr Val Pro Glu Val Thr Gly 
340 345 350 

Thr His Lys Val Thr Val Leu Phe Ala Gly Gin His lie Ala Lys Ser 
355 360 365 

Pro Phe Glu Val Tyr Val Asp Lys Ser Gin Gly Asp Ala Ser Lys Val 
370 375 380 

Thr Ala Gin Gly Pro Gly Leu Glu Pro Ser Gly Asn lie Ala Asn Lys 
385 390 395 400 

Thr Thr Tyr Phe Glu lie Phe Thr Ala Gly Ala Gly Thr Gly Glu Val 
405 410 415 

Glu Val Val lie Gin Asp Pro Met Gly Gin Lys Gly Thr Val Glu Pro 
420 425 430 

Gin Leu Glu Ala Arg Gly Asp Ser Thr Tyr Arg Cys Ser Tyr Gin Pro 
435 440 445 

Thr Met Glu Gly Val His Thr Val His Val Thr Phe Ala Gly Val Pro 
45C 455 460 

He Pro Arg Ser Pro Tyr Thr Val Thr Val Gly Gin Ala Cys Asn Pro 
465 470 475 480 

Ser Ala Cys Arg Ala Val Gly Arg Gly Leu Gin Pro Lys Gly Val Arg 
485 490 495 

Val Lys Glu Thr Ala Asp Phe Lys Val Tyr Thr Lys Gly Ala Gly Ser 
500 505 510 

Gly Glu Leu Lys Val Thr Val Lys Gly Pro Lys Gly Glu Glu Arg Val 
515 520 525 

Lys Gin Lys Asp Leu Gly Asp Gly Val Tyr Gly Phe Glu Tyr Tyr Pro 
530 535 540 

Met Val Pro Gly Thr Tyr lie Val Thr He Thr Trp Gly Gly Gin Asn 
545 550 555 560 



He Gly Arg Ser Pro Phe Glu Val Lys Val Gly Thr Glu Cys Gly Asn 

565 570 575 

Gin Lys Val Arg Ala Trp Gly Pro Gly Leu Glu Gly Gly Val Val Gly 

580 585 590 
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Lys Ser Ala Asp 
595 

Leu Gly Phe Ser 
610 

Asp Lys Gly Asp 
625 

Gly Glu Tyr Ala 



Ser Pro Phe Met 
660 

Asp Arg Val Lys 
675 

Val Asn Lys Pro 
690 

Ala Pro Leu Arg 
705 



Ala Leu Val Lys 



Pro Arg Lys Pro 
740 

Ser lie Pro Asn 
755 

Pro Asn Lys Val 
770 

Lys Ala His Glu 
785 

Gin Gly Asp Val 



Pro Ala Glu Ala 
820 

Thr Phe Thr Val 
835 

Met Val Leu Phe 
850 

Lys Val Glu Pro 
865 

Gly Leu Ser Arg 



Val Asn Ala Lys 
900 

Gly Leu Thr Lys 
915 

His Asp Asn Thr 
930 



Phe Val 

Val Glu 

Gly Ser 
630 

Val His 
645 

Ala Asp 

Ala Arg 

Ala Glu 

Val Gin 
710 

Asp Asn 
725 

Val Lys 
Ser Pro 



Lys Val 



Pro Thr 
790 

Ser lie 
805 

Asp lie 

Lys Tyr 

Ala Asp 

Ser His 
870 

Thr Gly 
885 

Ala Ala 
Gly Asp 
Tyr Thr 



Val Glu 
600 

Gly Pro 
615 

Cys Asp 

Val Leu 

lie Arg 

Gly Pro 
680 

Phe Thr 
695 

Val Gin 
Gly Asn 



His Thr 

Phe Arg 
760 

Tyr Gly 
775 

Tyr Phe 

Gly lie 

Asp Phe 

Thr Pro 
840 

Gin Ala 
855 

Asp Ala 

Val Glu 

Gly Lys 

Ala Val 
920 

Val Lys 
935 



- 50 
Ala lie 

Ser Gin 

Val Arg 

Cys Asn 
650 

Asp Ala 

665 

Gly Leu 
Val Asp 



Asp Asn 



Gly Thr 
730 

Ala Met 
745 

Val Asn 

Pro Gly 

Thr Val 

Lys Cys 
810 

Asp lie 
825 

Arg Gly 

Thr Pro 

Ser Lys 

Leu Gly 
890 

Gly Lys 
905 

Arg Asp 



Tyr Thr 



Gly Asp 



Ala Lys 
620 

Tyr Trp 
635 

Ser Glu 



Pro Gin 



Glu Lys 

Ala Lys 
700 

Glu Gly 
715 

Tyr Ser 

Val Ser 

Val Gly 

Val Ala 
780 

Asp Cys 
795 

Ala Pro 

lie Arg 

Ala Gly 

Thr Ser 
860 

Val Lys 
875 

Lys Pro 

Leu Asp 

Val Asp 

Pro Val 
940 



Asp Val 
605 

lie Glu 
Pro Gin 
Asp lie 



Asp Phe 
670 

Thr Gly 
685 

His Gly 
Cys Pro 

Cys Ser 

Trp Gly 
750 

Ala Gly 
765 

Lys Thr 
Ala Glu 



Gly Val 



Asn Asp 
830 

Ser Tyr 
845 

Pro lie 



Ala Glu 



Thr His 



Val Gin 
910 

lie lie 
925 

Gin Gin 



Gly Thr 

Cys Asp 

Glu Ala 
640 

Arg Leu 
655 

His Pro 

Val Ala 

Gly Lys 

Val Glu 
720 

Tyr Val 
735 

Gly Val 

Ser His 

Gly Leu 

Ala Gly 
800 

Val Gly 
815 

Asn Asp 

Thr lie 

Arg Val 

Gly Pro 
880 

Phe Thr 
895 

Phe Ser 
Asp His 
Gly Pro 
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Val Gly Val Asn Val Thr Tyr Gly Gly Asp Pro lie Pro Lys Ser Pro 
945 950 955 960 

Phe Ser Val Ala Val Ser Pro Ser Leu Asp Leu Ser Lys lie Lys Val 
965 970 975 

Ser Gly Leu Gly Glu Lys Val Asp Val Gly Lys Asp Gin Glu Phe Thr 
980 985 990 

Val Lys Ser Lys Gly Ala Gly Gly Gin Gly Lys Val Ala Ser Lys lie 
995 1000 1005 

Val Gly Pro Ser Gly Ala Ala Val Pro Cys Lys Val Glu Pro Gly Leu 
1010 1015 1020 

Gly Ala Asp Asn Ser Val Val Arg Phe Leu Pro Arg Glu Glu Gly Pro 
1025 1030 1035 1040 

Tyr Glu Val Glu Val Thr Tyr Asp Gly Val Pro Val Pro Gly Ser Pro 
1045 1050 1055 

Phe Pro Leu Glu Ala Val Ala Pro Thr Lys Pro Ser Lys Val Lys Ala 
1060 1065 1070 

Phe Gly Pro Gly Leu Gin Gly Gly Ser Ala Gly Ser Pro Ala Arg Phe 
1075 1080 1085 

Thr lie Asp Thr Lys Gly Ala Gly Thr Gly Gly Leu Gly Leu Thr Val 
1090 1095 1100 

Glu Gly Pro Cys Glu Ala Gin Leu Glu Cys Leu Asp Asn Gly Asp Gly 
1105 1110 1115 1120 

Thr Cys Ser Val Ser Tyr Val Pro Thr Glu Pro Gly Asp Tyr Asn lie 
1125 1130 1135 

Asn lie Leu Phe Ala Asp Thr His lie Pro Gly Ser Pro Phe Lys Ala 
1140 1145 1150 

His Val Val Pro Cys Phe Asp Ala Ser Lys Val Lys Cys Ser Gly Pro 
1155 1160 1165 

Gly Leu Glu Arg Ala Thr Ala Gly Glu Val Gly Gin Phe Gin Val Asp 
1170 1175 1180 

Cys Ser Ser Ala Gly Ser Ala Glu Leu Thr He Glu He Cys Ser Glu 
1185 1190 1195 1200 

Ala Gly Leu Pro Ala Glu Val Tyr He Gin Asp His Gly Asp Gly Thr 
1205 1210 1215 

His Thr He Thr Tyr He Pro Leu Cys Pro Gly Ala Tyr Thr Val Thr 
1220 1225 1230 

He Lys Tyr Gly Gly Gin Pro Val Pro Asn Phe Pro Ser Lys Leu Gin 
1235 1240 1245 

Val Glu Pro Ala Val Asp Thr Ser Gly Val Gin Cys Tyr Gly Pro Gly 
1250 1255 1260 

He Glu Gly Gin Gly Val Phe Arg Glu Ala Thr Thr Glu Phe Ser Val 
1265 1270 1275 1280 

Asp Ala Arg Ala Leu Thr Gin Thr Gly Gly Pro His Val Lys Ala Arg 
1285 1290 1295 
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Val Ala Ash Pro Ser Gly Asn Leu Thr Glu Thr Tyr Val Gin Asp Arg 
1300 1305 1310 

Gly Asp Gly Met Tyr Lys Val Glu Tyr Thr Pro Tyr Glu Glu Gly Leu 
1315 1320 1325 

His Ser Val Asp Val Thr Tyr Asp Gly Ser Pro Val Pro Ser Ser Pro 
1330 1335 1340 

Phe Gin Val Pro Val Thr Glu Gly Cys Asp Pro Ser Arg Val Arg Val 
1345 1350 1355 1360 

His Gly Pro Gly He Gin Ser Gly Thr Thr Asn Lys Pro Asn Lys Phe 
1365 1370 1375 

Thr Val Glu Thr Arg Gly Ala Gly Thr Gly Gly Leu Gly Leu Ala Val 
1380 1385 1390 

Glu Gly Pro Ser Glu Ala Lys Met Ser Cys Met Asp Asn Lys Asp Gly 
1395 1400 1405 

Ser Cys Ser Val Glu Tyr He Pro Tyr Glu Ala Gly Thr Tyr Ser Leu 
1410 1415 1420 

Asn Val Thr Tyr Gly Gly His Gin Val Pro Gly Ser Pro Phe Lys Val 
1425 1430 1435 1440 

Pro Val His Asp Val Thr Asp Ala Ser Lys Val Lys Cys Ser Gly Pro 
1445 1450 1455 

Gly Leu Ser Pro Gly Met Val Arg Ala Asn Leu Pro Gin Ser Phe Gin 
1460 1465 1470 

Val Asp Thr Ser Lys Ala Gly Val Ala Pro Leu Gin Val Lys Val Gin 
1475 1480 1485 

Gly Pro Lys Gly Leu Val Glu Pro Val Asp Val Val Asp Asn Ala Asp 
1490 1495 1500 

Gly Thr Gin Thr Val Asn Tyr Val Pro Ser Arg Glu Gly Pro Tyr Ser 
1505 1510 1515 1520 

He Ser Val Leu Tyr Gly Asp Glu Glu Val Pro Arg Ser Pro Phe Lys 
1525 1530 1535 

Val Lys Val Leu Pro Thr His Asp Ala Ser Lys Val Lys Ala Ser Gly 
1540 1545 1550 

Pro Gly Leu Asn Thr Thr Gly Val Pro Ala Ser Leu Pro Val Glu Phe 
1555 1560 1565 

Thr lie Asp Ala Lys Asp Ala Gly Glu Gly Leu Leu Ala Val Gin He 
1570 1575 1580 

Thr Asp Pro Glu Gly Lys Pro Lys Lys Thr His He Gin Asp Asn His 
!585 1590 1595 1600 

Asp Gly Thr Tyr Thr Val Ala Tyr Val Pro Asp Val Thr Gly Arg Tyr 
1605 1610 1615 

Thr He Leu He Lys Tyr Gly Gly Asp Glu He Pro Phe Ser Pro Tyr 
1620 1625 1630 

Arg Val Arg Ala Val Pro Thr Gly Asp Ala Ser Lys Cys Thr Val Thr 
1635 1640 1645 



BNSDOCID: <WO 9725423A1 J_> 



WO 97/25423 A PCT/US97/00100 

- 53 - 

Val Ser lie Gly Gly His Gly Leu Gly Ala Gly lie Gly Pro Thr lie 
1650 1655 1660 

Gin lie Gly Glu Glu Thr Val lie Thr Val Asp Thr Lys Ala Ala Gly 
1665 1670 1675 1680 

Lys Gly Lys Val Thr Cys Thr Val Cys Thr Pro Asp Gly Ser Glu Val 
1685 1690 1695 

Asp Val Asp Val Val Glu Asn Glu Asp Gly Thr Phe Asp He Phe Tyr 
1700 1705 1710 

Thr Ala Pro Gin Pro Gly Lys Tyr Val He Cys Val Arg Phe Gly Gly 
1715 1720 1725 

Glu His Val Pro Asn Ser Pro Phe Gin Val Thr Ala Leu Ala Gly Asp 
1730 1735 1740 

Gin Pro Ser Val Gin Pro Pro Leu Arg Ser Gin Gin Leu Ala Pro Gin 
1745 1750 1755 1760 

Tyr Thr Tyr Ala Gin Gly Gly Gin Gin Thr Trp Ala Pro Glu Arg Pro 
1765 1770 1775 

Leu Val Gly Val Asn Gly Leu Asp Val Thr Ser Leu Arg Pro Phe Asp 
1780 1785 1790 

Leu Val He Pro Phe Thr He Lys Lys Gly Glu He Thr Gly Glu Val 
1795 1800 1805 

Arg Met Pro Ser Gly Lys Val Ala Gin Pro Thr He Thr Asp Asn Lys 
1810 1815 1820 

Asp Gly Thr Val Thr Val Arg Tyr Ala Pro Ser Glu Ala Gly Leu His 
1825 1830 1835 1840 

Glu Met Asp He Arg Tyr Asp Asn Met His He Pro Gly Ser Pro Leu 
1845 1850 1855 

Gin Phe Tyr Val Asp Tyr Val Asn Cys Gly His Val Thr Ala Tyr Gly 
1860 1865 1870 

Pro Gly Leu Thr His Gly Val Val Asn Lys Pro Ala Thr Phe Thr Val 
1875 1880 1885 

Asn Thr Lys Asp Ala Gly Glu Gly Gly Leu Ser Leu Ala lie Glu Gly 
1890 1895 1900 

Pro Ser Lys Ala Glu He Ser Cys Thr Asp Asn Gin Asp Gly Thr Cys 
1905 1910 1915 1920 

Ser Val Ser Tyr Leu Pro Val Leu Pro Gly Asp Tyr Ser He Leu Val 
1925 1930 1935 

Lys Tyr Asn Glu Gin His Val Pro Gly Ser Pro Phe Thr Ala Arg Val 
1940 1945 1950 

Thr Gly Asp Asp Ser Met Arg Met Ser His Leu Lys Val Gly Ser Ala 
1955 1960 1965 

Ala Asp He Pro He Asn He Ser Glu Thr Asp Leu Ser Leu Leu Thr 
1970 1975 1980 

Ala Thr Val Val Pro Pro Ser Gly Arg Glu Glu Pro Cys Leu Leu Lys 
1985 1990 1995 2000 
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Arg Leu Arg Asn Gly His Val Gly lie Ser Phe Val Pro Lys Glu Thr 
2005 2010 2015 

Gly Glu His Leu Val His Val Lys Lys Asn Gly Gin His Val Ala Ser 
2020 2025 2030 

Ser Pro lie Pro Val Val lie Ser Gin Ser Glu lie Gly Asp Ala Ser 
2035 2040 2045 

Arg Val Arg Val Ser Gly Gin Gly Leu His Glu Gly His Thr Phe Glu 
2050 2055 2060 

Pro Ala Glu Phe He He Asp Thr Arg Asp Ala Gly Tyr Gly Gly Leu 
2065 2070 2075 2080 

Ser Leu Ser He Glu Gly Pro Ser Lys Val Asp He Asn Thr Glu Asp 
2085 2090 2095 

Leu Glu Asp Gly Thr Cys Arg Val Thr Tyr Cys Pro Thr Glu Pro Gly 
2100 2105 2110 

Asn Tyr He He Asn He Lys Phe Ala Asp Gin His Val Pro Gly Ser 
2115 2120 2125 

Pro Phe Ser Val Lys Val Thr Gly Glu Gly Arg Val Lys Glu Ser lie 
2130 2135 2140 

Thr Arg Arg Arg Arg Ala Pro Ser Val Ala Asn Val Gly Ser His Cys 
2145 2150 2155 2160 

Asp Leu Ser Leu Lys He Pro Glu He Ser He Gin Asp Met Thr Ala 
2165 2170 2175 

Gin Val Thr Ser Pro Ser Gly Lys Thr His Glu Ala Glu He Val Glu 
2180 2185 2190 

Gly Glu Asn His Thr Tyr Cys He Arg Phe Val Pro Ala Glu Met Gly 
2195 2200 2205 

Thr His Thr Val Ser Val Lys Tyr Lys Gly Gin His Val Pro Gly Ser 
2210 2215 2220 

Pro Phe Gin Phe Thr Val Gly Pro Leu Gly Glu Gly Gly Ala His Lys 
2225 2230 2235 2240 

Val Arg Ala Gly Gly Pro Gly Leu Glu Arg Ala Glu Ala Gly Val Pro 
2245 2250 2255 

Ala Glu Phe Ser He Trp Thr Arg Glu Ala Gly Ala Gly Gly Leu Ala 
2260 2265 2270 

He Ala Val Glu Gly Pro Ser Lys Ala Glu He Ser Phe Glu Asp Arg 
2275 2280 2285 

Lys Asp Gly Ser Cys Gly Val Ala Tyr Val Val Gin Glu Pro Gly Asp 
2290 2295 2300 

Tyr Glu Val Ser Val Lys Phe Asn Glu Glu His He Pro Asp Ser Pro 
2305 2310 2315 2320 

Phe Val Val Pro Val Ala Ser Pro Ser Gly Asp Ala Arg Arg Leu Thr 
2325 2330 2335 

Val Ser Ser Leu Gin Glu Ser Gly Leu Lys Val Asn Gin Pro Ala Ser 
2340 2345 2350 
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Phe Ala Val Ser Leu Asn Gly Ala Lys Gly Ala lie Asp Ala Lys Val 
2355 2360 2365 

His Ser Pro Ser Gly Ala Leu Glu Glu Cys Tyr Val Thr Glu lie Asp 
2370 2375 2380 

Gin Asp Lys Tyr Ala Val Arg Phe lie Pro Arg Glu Asn Gly Val Tyr 
2385 2390 2395 2400 

Leu lie Asp Val Lys Phe Asn Gly Thr His lie Pro Gly Ser Pro Phe 
2405 2410 2415 

Lys lie Arg Val Gly Glu Pro Gly His Gly Gly Asp Pro Gly Leu Val 
2420 2425 2430 

Ser Ala Tyr Gly Ala Gly Leu Glu Gly Gly Val Thr Gly Asn Pro Ala 
2435 2440 2445 

Glu Phe Val Val Asn Thr Ser Asn Ala Gly Ala Gly Ala Leu Ser Val 
2450 2455 2460 

Thr lie Asp Gly Pro Ser Lys Val Lys Met Asp Cys Gin Glu Cys Pro 
2465 2470 2475 2480 

Glu Gly Tyr Arg Val Thr Tyr Thr Pro Met Ala Pro Gly Ser Tyr Leu 
2485 2490 2495 

lie Ser He Lys Tyr Gly Gly Pro Tyr His He Gly Gly Ser Pro Phe 
2500 2505 2510 

Lys Ala Lys Val Thr Gly Pro Arg Leu Val Ser Asn His Ser Leu His 
2515 2520 2525 

Glu Thr Ser Ser Val Phe Val Asp Ser Leu Thr Lys Ala Thr Cys Ala 
2530 2535 2540 

Pro Gin His Gly Ala Pro Gly Pro Gly Pro Ala Asp Ala Ser Lys Val 
2545 2550 2555 2560 

Val Ala Lys Gly Leu Gly Leu Ser Lys Ala Tyr Val Gly Gin Lys Ser 
2565 2570 2575 

Ser Phe Thr Val Asp Cys Ser Lys Ala Gly Asn Asn Met Leu Leu Val 
2580 2585 2590 

Gly Val His Gly Pro Arg Thr Pro Cys Glu Glu He Leu Val Lys His 
2595 2600 2605 

Val Gly Ser Arg Leu Tyr Ser Val Ser Tyr Leu Leu Lys Asp Lys Gly 
2610 2615 2620 

Glu Tyr Thr Leu Val Val Lys Trp Gly His Glu His He Pro Gly Ser 
2625 2630 2635 2640 

Pro Tyr Arg Val Val Val Pro 
2645 



(2) INFORMATION FOR SEQ ID NO ; 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

{ i i ) MOLECULE TYPE : cDNA 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

<B) LOCATION: 1 . . 1125 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 9 : 

TTC GAG ATG TCT GAC TTC ATC GTG GAC ACA AGG GAT GCA GGT TAT GGT 4 8 

Phe Glu Met Ser Asp Phe lie Val Asp Thr Arg Asp Ala Gly Tyr Gly 
1 5 10 15 

GGC ATA TCC TTG GCG GTG GAA GGC CCC AGC AAA GTG GAC ATC CAG ACG 96 
Gly lie Ser Leu Ala Val Glu Gly Pro Ser Lys Val Asp lie Gin Thr 
20 25 30 

GAG GAC CTG GAA GAT GGC ACC TGC AAA GTC TCC TAC TTC CCT ACC GTG 144 
Glu Asp Leu Glu Asp Gly Thr Cys Lys Val Ser Tyr Phe Pro Thr Val 
35 40 45 

CCT GGG GTT TAT ATC GTC TCC ACC AAA TTC GCT GAC GAG CAC GTG CCT 19 2 

Pro Gly Val Tyr lie Val Ser Thr Lys Phe Ala Asp Glu His Val Pro 
50 55 60 

GGG AGC CCA TTT ACC GTG AAG ATC AGT GGG GAG GGA AGA GTC AAA GAG 24 0 

Gly Ser Pro Phe Thr Val Lys He Ser Gly Glu Gly Arg Val Lys Glu 
65 70 75 80 

AGC ATC ACC CGC ACC AGT CGG GCC CCG TCC GTG GCC ACT GTC GGG AGC 288 
Ser He Thr Arg Thr Ser Arg Ala Pro Ser Val Ala Thr Val Gly Ser 
85 90 95 

ATT TGT GAC CTG AAC CTC AAA ATC CCA GAA ATC AAC AGC AGT GAT ATG 3 36 

He Cys Asp Leu Asn Leu Lys He Pro Glu He Asn Ser Ser Asp Met 
100 105 110 

TCG GCC CAC GTC ACC AGC CCC TCT GGC CGT GTG ACT GAG GCA GAG ATT 3 84 

Ser Ala His Val Thr Ser Pro Ser Gly Arg Val Thr Glu Ala Glu He 
115 120 125 

GTG CCC ATG GGG AAG AAC TCA CAC TGC GTC CGG TTT GTG CCC CAG GAG 432 
Val Pro Met Gly Lys Asn Ser His Cys Val Arg Phe Val Pro Gin Glu 
130 135 140 

ATG GGC GTG CAC ACG GTC AGC GTC AAG TAC CGT GGG CAG CAC GTC ACC 4 80 

Met Gly Val His Thr Val Ser Val Lys Tyr Arg Gly Gin His Val Thr 
14 5 150 155 160 

GGC AGC CCC TTC CAG TTC ACC GTG GGG GCA CTT GGT GAA GGA GGC GCC 52 8 

Gly Ser Pro Phe Gin Phe Thr Val Gly Ala Leu Gly Glu Gly Gly Ala 
165 170 175 

CAC AAG GTG CGG GCA GGA GGC CCT GGC CTG GAG AGA GGA GAA GCG GGA 576 
His Lys Val Arg Ala Gly Gly Pro Gly Leu Glu Arg Gly Glu Ala Gly 
180 185 190 

GTC CCA GCT GAG TTC AGC ATT TGG ACC CGG GAA GCA GGC GCT GGA GGC 624 
Val Pro Ala Glu Phe Ser lie Trp Thr Arg Glu Ala Gly Ala Gly Gly 
195 200 205 

CTC TCC ATC GCT GTT GAG GGC CCC AGT AAG GCC GAG ATT ACA TTC GAT 672 
Leu Ser He Ala Val Glu Gly Pro Ser Lys Ala Glu He Thr Phe Asp 
210 215 220 

GAC CAT AAA AAT GGG TCG TGC GGT GTA TCT TAT ATT GCC CAA GAG CCT 720 
Asp His Lys Asn Gly Ser Cys Gly Val Ser Tyr lie Ala Gin Glu Pro 
225 230 235 240 
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GGT AAC TAC GAG GTG TCC ATC AAG TTC AAT GAT GAG CAC ATC CCG GAA 768 
Gly Asn Tyr Glu Val Ser lie Lys Phe Asn Asp Glu His lie Pro Glu 
245 250 255 

AGC CCC TAC CTG GTG CCG GTC ATC GCA CCC TCC GAC GAC GCC CGC CGC 816 
Ser Pro Tyr Leu Val Pro Val lie Ala Pro Ser Asp Asp Ala Arg Arg 
260 265 270 

CTC ACT GTT ATG AGC CTT CAG GAA TCG GGA TTA AAA GTT AAC CAG CCA 8 64 

Leu Thr Val Met Ser Leu Gin Glu Ser Gly Leu Lys Val Asn Gin Pro 
275 280 285 

GCA TCC TTT GCT ATA AGG TTG AAT GGC GCA AAA GGC AAG ATT GAT GCA 912 
Ala Ser Phe Ala lie Arg Leu Asn Gly Ala Lys Gly Lys lie Asp Ala 
290 295 300 

AAG GTG CAC AGC CCC TCT GGA GCC GTG GAG GAG TGC CAC GTG TCT GAG 96 0 

Lys Val His Ser Pro Ser Gly Ala Val Glu Glu Cys His Val Ser Glu 
305 310 315 320 

CTG GAG CCA GAT AAG TAT GCT GTT CGC TTC ATC CCT CAT GAG AAT GGT 100 8 

Leu Glu Pro Asp Lys Tyr Ala Val Arg Phe lie Pro His Glu Asn Gly 
325 330 335 

GTC CAC ACC ATC GAT GTC AAG TTC AAT GGG AGC CAC GTG GTT GGA AGC 105 6 

Val His Thr lie Asp Val Lys Phe Asn Gly Ser His Val Val Gly Ser 
340 345 350 

CCC TTC AAA GTG CGC GTT GGG GAG CCT GGA CAA GCG GGG AAC CCT GCC 1104 
Pro Phe Lys Val Arg Val Gly Glu Pro Gly Gin Ala Gly Asn Pro Ala 
355 360 365 

CTG GTG TCC GCC TAT GGC ACG 112 5 

Leu Val Ser Ala Tyr Gly Thr 
370 375 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 amino acids 

(B) TYPE: amino acid 

<D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Phe Glu Met Ser Asp Phe lie Val Asp Thr Arg Asp Ala Gly Tyr Gly 
15 10 15 

Gly lie Ser Leu Ala Val Glu Gly Pro Ser Lys Val Asp lie Gin Thr 
20 25 30 

Glu Asp Leu Glu Asp Gly Thr Cys Lys Val Ser Tyr Phe Pro Thr Val 
35 40 45 

Pro Gly Val Tyr lie Val Ser Thr Lys Phe Ala Asp Glu His Val Pro 
50 55 60 

Gly Ser Pro Phe Thr Val Lys lie Ser Gly Glu Gly Arg Val Lys Glu 
65 70 75 80 

Ser lie Thr Arg Thr Ser Arg Ala Pro Ser Val Ala Thr Val Gly Ser 
85 90 95 
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He Cys Asp Leu Asn Leu Lys He Pro Glu lie Ash Ser Ser Asp Met 
100 105 no 

Ser Ala His Val Thr Ser Pro Ser Gly Arg Val Thr Glu Ala Glu He 
115 120 125 

Val Pro Met Gly Lys Asn Ser His Cys Val Arg Phe Val Pro Gin Glu 
130 135 140 

Met Gly Val His Thr Val Ser Val Lys Tyr Arg Gly Gin His Val Thr 
145 150 155 160 

Gly Ser Pro Phe Gin Phe Thr Val Gly Ala Leu Gly Glu Gly Gly Ala 
165 170 175 

His Lys Val Arg Ala Gly Gly Pro Gly Leu Glu Arg Gly Glu Ala Gly 
180 185 190 

Val Pro Ala Glu Phe Ser lie Trp Thr Arg Glu Ala Gly Ala Gly Gly 
195 200 205 

Leu Ser He Ala Val Glu Gly Pro Ser Lys Ala Glu lie Thr Phe Asp 
210 215 220 

Asp His Lys Asn Gly Ser Cys Gly Val Ser Tyr He Ala Gin Glu Pro 
225 230 235 240 

Gly Asn Tyr Glu Val Ser lie Lys Phe Asn Asp Glu His He Pro Glu 
245 250 255 

Ser Pro Tyr Leu Val Pro Val He Ala Pro Ser Asp Asp Ala Arg Arg 
260 265 270 

Leu Thr Val Met Ser Leu Gin Glu Ser Gly Leu Lys Val Asn Gin Pro 
275 280 285 

Ala Ser Phe Ala He Arg Leu Asn Gly Ala Lys Gly Lys He Asp Ala 
290 295 300 

Lys Val His Ser Pro Ser Gly Ala Val Glu Glu Cys His Val Ser Glu 
305 310 315 320 

Leu Glu Pro Asp Lys Tyr Ala Val Arg Phe He Pro His Glu Asn Gly 
325 330 335 

Val His Thr He Asp Val Lys Phe Asn Gly Ser His Val Val Gly Ser 
340 345 350 

Pro Phe Lys Val Arg Val Gly Glu Pro Gly Gin Ala Gly Asn Pro Ala 
355 360 365 

Leu Val Ser Ala Tyr Gly Thr 
370 375 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1494 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : cDNA 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..X449 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO ill: 

AAA ATC CCA GAA ATC AAC AGC AGT GAT ATG TCG GCC CAC GTC ACC AGC 4 8 

Lys lie Pro Glu lie Asn Ser Ser Asp Met Ser Ala His Val Thr Ser 
15 10 15 

CCC TCT GGC CGT GTG ACT GAG GCA GAG ATT GTG CCC ATG GGG AAG AAC 9 6 

Pro Ser Gly Arg Val Thr Glu Ala Glu lie Val Pro Met Gly Lys Asn 
20 25 30 

TCA CAC TGC GTC CGG TTT GTG CCC CAG GAG ATG GGC GTG CAC ACG GTC 14 4 

Ser His Cys Val Arg Phe Val Pro Gin Glu Met Gly Val His Thr Val 
35 40 45 

AGC GTC AAG TAC CGT GGG CAG CAC GTC ACC GGC AGC CCC TTC CAG TTC 19 2 

Ser Val Lys Tyr Arg Gly Gin His Val Thr Gly Ser Pro Phe Gin Phe 
50 55 60 

ACC GTG GGG GCA CTT GGT GAA GGA GGC GCC CAC AAG GTG CGG GCA GGA 24 0 

Thr Val Gly Ala Leu Gly Glu Gly Gly Ala His Lys Val Arg Ala Gly 
65 70 75 80 

GGC CCT GGC CTG GAG AGA GGA GAA GCG GGA GTC CCA GCT GAG TTC AGC 2 88 

Gly Pro Gly Leu Glu Arg Gly Glu Ala Gly Val Pro Ala Glu Phe Ser 
85 90 95 

ATT TGG ACC CGG GAA GCA GGC GCT GGA GGC CTC TCC ATC GCT GTT GAG 33 6 

He Trp Thr Arg Glu Ala Gly Ala Gly Gly Leu Ser He Ala Val Glu 
100 105 110 

GGC CCC AGT AAG GCC GAG ATT ACA TTC GAT GAC CAT AAA AAT GGG TCG 3 84 

Gly Pro Ser Lys Ala Glu He Thr Phe Asp Asp His Lys Asn Gly Ser 
115 120 125 

TGC GGT GTA TCT TAT ATT GCC CAA GAG CCT GGT AAC TAC GAG GTG TCC 432 
Cys Gly Val Ser Tyr He Ala Gin Glu Pro Gly Asn Tyr Glu Val Ser 
130 135 140 

ATC AAG TTC AAT GAT GAG CAC ATC CCG GAA AGC CCC TAC CTG GTG CCG 4 80 

He Lys Phe Asn Asp Glu His He Pro Glu Ser Pro Tyr Leu Val Pro 
145 150 155 160 

GTC ATC GCA CCC TCC GAC GAC GCC CGC CGC CTC ACT GTT ATG AGC CTT 52 8 

Val He Ala Pro Ser Asp Asp Ala Arg Arg Leu Thr Val Met Ser Leu 
165 170 175 

CAG GAA TCG GGA TTA AAA GTT AAC CAG CCA GCA TCC TTT GCT ATA AGG 57 6 

Gin Glu Ser Gly Leu Lys Val Asn Gin Pro Ala Ser Phe Ala He Arg 
180 185 190 

TTG AAT GGC GCA AAA GGC AAG ATT GAT GCA AAG GTG CAC AGC CCC TCT 624 
Leu Asn Gly Ala Lys Gly Lys He Asp Ala Lys Val His Ser Pro Ser 
195 200 205 

GGA GCC GTG GAG GAG TGC CAC GTG TCT GAG CTG GAG CCA GAT AAG TAT 67 2 

Gly Ala Val Glu Glu Cys His Val Ser Glu Leu Glu Pro Asp Lys Tyr 
210 215 220 

GCT GTT CGC TTC ATC CCT CAT GAG AAT GGT GTC CAC ACC ATC GAT GTC 72 0 

Ala Val Arg Phe He Pro His Glu Asn Gly Val His Thr He Asp Val 
225 230 235 240 
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AAG TTC AAT GGG AGC CAC GTG GTT GGA AGO CCC TTC AAA GTG CGC GTT 76 8 

Lys Phe Asn Gly Ser His Val Val Gly Ser Pro Phe Lys Val Arg Val 
245 250 255 

GGG GAG CCT GGA CAA GCG GGG AAC CCT GCC CTG GTG TCC GCC TAT GGC 816 
Gly Glu Pro Gly Gin Ala Gly Asn Pro Ala Leu Val Ser Ala Tyr Gly 
260 265 270 

ACG GGA CTC GAA GGG GGN ACC ACA GGT ATC CAG TCG GAA TTC TTT ATT 8 64 

Thr Gly Leu Glu Gly Xaa Thr Thr Gly lie Gin Ser Glu Phe Phe lie 
275 280 285 

AAC ACC ACC CGA GCA GGT CCA GGG ACA TTA TCC GTC ACC ATC GAA GGC 912 
Asn Thr Thr Arg Ala Gly Pro Gly Thr Leu Ser Val Thr lie Glu Gly 
290 295 300 

CCA TCC AAG GTT AAA ATG GAT TGC CAG GAA ACA CCT GAA GGG TAC AAA 9 60 

Pro Ser Lys Val Lys Met Asp Cys Gin Glu Thr Pro Glu Gly Tyr Lys 
305 310 315 320 

GTC ATG TAC ACC CCC ATG GCT CCT GGT AAC TAC CTG ATC AGT GTC AAA 100 8 

Val Met Tyr Thr Pro Met Ala Pro Gly Asn Tyr Leu lie Ser Val Lys 
325 330 335 

TAC GGT GGG CCC AAC CAC ATC GTG GGC AGT CCC TTC AAG GCC AAG GTG 1056 
Tyr Gly Gly Pro Asn His lie Val Gly Ser Pro Phe Lys Ala Lys Val 
340 345 350 

ACT GGC CAG CGT CTA GTT AGC CCT GGC TCA GCC AAC GAG ACC TCA TCC 1104 
Thr Gly Gin Arg Leu Val Ser Pro Gly Ser Ala Asn Glu Thr Ser Ser 
355 360 365 

ATC CTG GTG GAG TCA GTG ACC AGG TCG TCT ACA GAG ACC TGC TAT AGC 115 2 

lie Leu Val Glu Ser Val Thr Arg Ser Ser Thr Glu Thr Cys Tyr Ser 
370 375 380 

GCC ATT CCC AAG GCA TCC TCG GAC GCC AGC AAG GTG ACC TCT AAG GGG 12 0 0 

Ala lie Pro Lys Ala Ser Ser Asp Ala Ser Lys Val Thr Ser Lys Gly 
385 390 395 400 

GCA GGG CTC TCA AAG GCC TTT GTG GGC CAG AAG AGT TCC TTC CTG GTG 12 4 8 

Ala Gly Leu Ser Lys Ala Phe Val Gly Gin Lys Ser Ser Phe Leu Val 
405 410 415 

GAC TGC AGC AAA GCT GGC TCC AAC ATG CTG CTG ATC GGG GTC CAT GGG 12 9 6 

Asp Cys Ser Lys Ala Gly Ser Asn Met Leu Leu lie Gly Val His Gly 
420 425 430 

CCC ACC ACC CCC TGC GAG GAG GTC TCC ATG AAG CAT GTA GGC AAC CAG 13 4 4 

Pro Thr Thr Pro Cys Glu Glu Val Ser Met Lys His Val Gly Asn Gin 
435 440 445 

CAA TAC AAC GTC ACA TAC GTC GTC AAG GAG AGG GGC GAT TAT GTG CTG 13 92 

Gin Tyr Asn Val Thr Tyr Val Val Lys Glu Arg Gly Asp Tyr Val Leu 
450 455 460 

GCT GTG AAG TGG GGG GAG GAA CAC ATC CCT GGC AGC CCT TTT CAT GTC 144 0 

Ala Val Lys Trp Gly Glu Glu His lie Pro Gly Ser Pro Phe His Val 
465 470 475 480 

ACA GTG CCT TAAAACAGTT TTCTCAAATC CTGGAAAAAA AAAAAAAAAA AAAAA 14 94 

Thr Val Pro 
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(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Lys He Pro Glu He Asn Ser Ser Asp Met Ser Ala His Val Thr Ser 
15 10 15 

Pro Ser Gly Arg Val Thr Glu Ala Glu He Val Pro Met Gly Lys Asn 
20 25 30 

Ser His Cys Val Arg Phe Val Pro Gin Glu Met Gly Val His Thr Val 
35 40 45 

Ser Val Lys Tyr Arg Gly Gin His Val Thr Gly Ser Pro Phe Gin Phe 
50 55 60 

Thr Val Gly Ala Leu Gly Glu Gly Gly Ala His Lys Val Arg Ala Gly 
65 70 75 80 

Gly Pro Gly Leu Glu Arg Gly Glu Ala Gly Val Pro Ala Glu Phe Ser 
85 90 95 

He Trp Thr Arg Glu Ala Gly Ala Gly Gly Leu Ser He Ala Val Glu 
100 105 110 

Gly Pro Ser Lys Ala Glu He Thr Phe Asp Asp His Lys Asn Gly Ser 
115 120 125 

Cys Gly Val Ser Tyr He Ala Gin Glu Pro Gly Asn Tyr Glu Val Ser 
130 135 140 

He Lys Phe Asn Asp Glu His He Pro Glu Ser Pro Tyr Leu Val Pro 
145 150 155 160 

Val He Ala Pro Ser Asp Asp Ala Arg Arg Leu Thr Val Met Ser Leu 
165 170 175 

Gin Glu Ser Gly Leu Lys Val Asn Gin Pro Ala Ser Phe Ala He Arg 
180 185 190 

Leu Asn Gly Ala Lys Gly Lys He Asp Ala Lys Val His Ser Pro Ser 
195 200 205 

Gly Ala Val Glu Glu Cys His Val Ser Glu Leu Glu Pro Asp Lys Tyr 
210 215 220 

Ala Val Arg Phe He Pro His Glu Asn Gly Val His Thr He Asp Val 
225 230 235 240 

Lys Phe Asn Gly Ser His Val Val Gly Ser Pro Phe Lys Val Arg Val 
245 250 255 

Gly Glu Pro Gly Gin Ala Gly Asn Pro Ala Leu Val Ser Ala Tyr Gly 
260 265 270 

Thr Gly Leu Glu Gly Xaa Thr Thr Gly He Gin Ser Glu Phe Phe He 
275 280 285 

Asn Thr Thr Arg Ala Gly Pro Gly Thr Leu Ser Val Thr He Glu Gly 
290 295 300 



BNSDOCID: <WO 9725423A1J_> 



WO 97/25423 



PCT/US97/00100 



- 62 - 

Pro Ser Lys Val Lys Met Asp Cys Gin Glu Thr Pro Glu Gly Tyr Lys 
305 310 315 320 

Val Met Tyr Thr Pro Met Ala Pro Gly Asn Tyr Leu He Ser Val Lys 
325 330 335 

Tyr Gly Gly Pro Asn His He Val Gly Ser Pro Phe Lys Ala Lys Val 
340 345 350 

Thr Gly Gin Arg Leu Val Ser Pro Gly Ser Ala Asn Glu Thr Ser Ser 
355 360 365 

He Leu Val Glu Ser Val Thr Arg Ser Ser Thr Glu Thr Cys Tyr Ser 
370 375 380 

Ala He Pro Lys Ala Ser Ser Asp Ala Ser Lys Val Thr Ser Lys Gly 
385 390 395 400 

Ala Gly Leu Ser Lys Ala Phe Val Gly Gin Lys Ser Ser Phe Leu Val 
405 410 415 

Asp Cys Ser Lys Ala Gly Ser Asn Met Leu Leu He Gly Val His Gly 
420 425 430 

Pro Thr Thr Pro Cys Glu Glu Val Ser Met Lys His Val Gly Asn Gin 
435 440 445 

Gin Tyr Asn Val Thr Tyr Val Val Lys Glu Arg Gly Asp Tyr Val Leu 
450 455 460 

Ala Val Lys Trp Gly Glu Glu His He Pro Gly Ser Pro Phe His Val 
465 470 475 480 

Thr Val Pro 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 
<B) TYPE: amino acid 
<C) STRANDEDNESS : not relevant 
(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Tyr Arg Leu Ser Val Glu He Tyr Asp Arg Arg Glu Tyr Ser Arg Phe 
1 5 10 15 

Glu Lys Glu Gin Gin Gin Leu Asn Trp Lys Gin Asp Ser Asn Pro Leu 
20 25 30 

Tyr Lys Ser Ala lie Thr Thr Thr He Asn Pro Arg Phe Gin Glu A3a 
35 40 45 

Asp Ser Pro Thr Leu 
50 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;14: 

Tyr Arg Leu Ser Val Glu He Tyr Asp Arg Arg Glu Tyr Ser Arg Phe 
15 10 15 

Glu Lys Glu Gin Gin Gin Leu Asn Trp Lys Gin Asp Ser Asn Pro 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
<D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Tyr Arg Leu Ser Val Glu He Tyr Asp Arg Arg Glu Tyr Ser Arg Phe 
15 10 15 

Glu Lys Glu Gin Gin Gin Leu Asn Trp Lys Gin Asp Ser Asn Pro Leu 
20 25 30 

Tyr Lys Ser Ala 

35 

(2) INFORMATION FOR SEQ ID NO: 16: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Tyr Arg Leu Ser Val Glu lie Tyr Asp Arg Arg Glu Tyr Ser Arg Phe 
15 10 15 

Glu Lys Glu Gin Gin Gin Leu Asn Trp Lys Gin Asp Ser Asn Pro Leu 

20 , 25 30 

Tyr Lys Ser Ala He Thr Thr Thr He Asn Pro 
35 40 

(2) INFORMATION FOR SEQ ID NO: 17: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Tyr Arg Leu Ser Val Glu lie Tyr Asp Arg Arg Glu Tyr Ser Arg Phe 
1 5 10 15 

Glu Lys Glu 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 8 : 

Tyr Arg Leu Ser Val Glu He Tyr Asp Arg Arg Glu Tyr Ser Arg 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GATGGCACTT TTGTACTAAG GATTACTGTC CTG 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
ATTGATGGTG GTCGTCTAGG CACTTTTGTA GAG 
(2) INFORMATION FOR SEQ ID NO : 2 1 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
GTCTGCCTCT TGAAACTAAG GATTGATGGT GGT 
(2) INFORMATION FOR SEQ ID NO: 22: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CCAGTTGAGT TGTTGCTACT CCTTCTCAAA GCG 3 3 

(2) INFORMATION FOR SEQ ID NO: 23: 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 34 base pairs 
£B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GTTGCTGCTC CTTCTCCTAG CGACTGTATT CCCG 34 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Tyr Arg Leu Ala Val Glu lie Tyr Asp Arg Arg Glu Tyr Ser Arg Phe Glu 
15 10 15 

Lys Glu Gin Gin Gin Leu Asn Trp Lys Gin Asp Ser Asn Pro Leu Tyr 
20 25 30 

Lys Ser Ala lie Thr Thr Thr He Asn Pro Arg Phe Gin Glu Ala Asp 
35 40 45 

Ser Pro Thr Leu 

50 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: not relevant 
(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Tyr Arg Leu Ser Val Gin He Tyr Asp Arg Arg Glu Tyr Ser Arg Phe Glu 
15 10 15 

Lys Glu Gin Gin Gin Leu Asn Trp Lys Gin Asp Ser Asn Pro Leu Tyr 

20 25 30 



BNSDOCID: <WO 9725423* 1 J _> 



WO 97/25423 



PCT/US97/00100 



- 66 - 

Lys Ser Ala lie Thr Thr Thr lie Asn Pro Arg Phe Gin Glu Ala Asp 
35 40 45 

Ser Pro Thr Leu 
50 

(2) INFORMATION FOR SEQ ID NO: 26: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Tyr Arg Leu Ser Val Glu lie Tyr Asp Ala Arg Glu Tyr Ser Arg Phe Glu 
15 10 15 

Lys Glu Gin Gin Gin Leu Asn Trp Lys Gin Asp Ser Asn Pro Leu Tyr 
20 25 30 

Lys Ser Ala lie Thr Thr Thr He Asn Pro Arg Phe Gin Glu Ala Asp 
35 40 45 

Ser Pro Thr Leu 
50 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Tyr Arg Leu Ser Val Glu He Tyr Asp Arg Arg Glu Tyr Ala Arg Phe Glu 
15 10 15 

Lys Glu Gin Gin Gin Leu Asn Trp Lys Gin Asp Ser Asn Pro Leu Tyr 
20 25 30 



Lys Ser Ala He Thr Thr Thr He Asn Pro Arg Phe Gin Glu Ala Asp 
35 40 45 

Ser Pro Thr Leu 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
G T CAT AG ATT TCCACCGCGA GCCGGTATCC GAG 3 3 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CCGGCCGTCA TAGATTTGCA CCGAGAGCCG GTATC 3 5 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GCGACTGTAT TCCCGCGCGT CATAGATTTC CAC 3 3 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 
CTCCTTCTCA AAGCGCGCGT ATTCCCGGCG GTC 3 3 

(2) INFORMATION FOR SEQ ID NO: 32: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
ATATCTCGAG AGTATACCCC CATGG CACCT 3 0 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: other nuce lie acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

ATATCTC GAG TCAGGGCACC ACAACGCG 28 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
ATATCTCGAG TCAGCTGCTC TTCTGGCCCT AC 3 2 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
ATATCATATG TACACCCCCA TGGCTCCT 2 8 

(2) INFORMATION FOR SEQ ID NO: 36: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO;36: 
ATAGGATCCT CAGCCCCACA AACAGGC 2 7 



(2) INFORMATION FOR SEQ ID NO: 37: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

GGTGGCCTTG GTCAGAGAGT CTACAAACAC 3 0 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GGCG CTATAG CAGGTCTCTG TAGACGACCT 30 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : not relevant 
(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

Tyr Arg Leu Ser Val Glu lie Tyr Asp Arg Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40:. 

Tyr Arg Leu Ser Val Glu lie Tyr Asp Arg Arg Glu Tyr Ser Arg Phe 
15 10 15 

Glu Lys Glu Gin Gin Gin Leu Asn Trp Lys Gin Asp Ser Asn Pro Leu 
20 25 30 

Tyr Lys Ser Ala lie Thr Thr Thr lie Asn Pro Arg Phe Gin Glu Ala 
35 40 45 

Asp 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Tyr Arg Leu Ser Val Glu lie Tyr Asp Arg Arg Glu Tyr Ser Arg Phe 
15 10 15 
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Glu Lys Glu Gin Gin Gin Leu Ash Tip Lys Gin Asp Ser Asn Pro Leu 
20 25 30 

Tyr Lys Ser Ala lie Thr Thr Thr lie Asn Pro Arg Phe 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 42: 

<i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
CTCAAAGCGA CTGTACT AC C GGCGGTCATA GATTTC 36 
(2) INFORMATION FOR SEQ ID NO : 4 3 : 

Ci) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 3 : 
CTTTCAAGAG GCAGACTGAC CCACTCTCTG AGGA 34 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:44: 
TCCTCAGAGA GTGGGTCAGT CTGCCTCTTG AAAG 34 
(2) INFORMATION FOR SEQ ID NO : 4 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
CATCAATCCT CGCTTTTGAG AGGCAGACAG TCCC 34 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D } TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO : 4 6 : 

GGGACTGTCT GCCTCTCAAA AGCGAGGATT GATC 34 

(2) INFORMATION FOR SEQ ID NO : 4 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Tyr Arg Leu Ser Phe Glu lie Tyr Asp Arg Arg Glu Tyr Ser Arg Phe 
15 10 15 

Glu Lys Glu Gin Gin Gin Leu Asn Trp Lys Gin Asp Ser Asn Pro Leu 
20 25 30 

Tyr Lys Ser Ala lie Thr Thr Thr lie Asn Pro Arg Phe Gin Glu Ala 
35 40 45 

Asp Ser Pro Thr Leu 
50 

(2) INFORMATION FOR SEQ ID NO : 4 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 8 : 

Tyr Arg Leu Ser Val Glu Phe Tyr Asp Arg Arg Glu Tyr Ser Arg Phe 
15 10 15 

Glu Lys Glu Gin Gin Gin Leu Asn Trp Lys Gin Asp Ser Asn Pro Leu 
20 25 30 

Tyr Lys Ser Ala lie Thr Thr Thr lie Asn Pro Arg Phe Gin Glu Ala 
35 40 45 

Asp Ser Pro Thr Leu 
50 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 
<B) TYPE: amino acid 
(C) STRANDEDNESS: not relevant 
<D) TOPOLOGY: not relevant 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Tyr Arg Leu Ser Val Glu lie Phe Asp Arg Arg Glu Tyr Ser Arg Phe 
15 10 15 

Glu Lys Glu Gin Gin Gin Leu Asn Trp Lys Gin Asp Ser Asn Pro Leu 
20 25 30 

Tyr Lys Ser Ala lie Thr Thr Thr lie Asn Pro Arg Phe Gin Glu Ala 
35 40 45 

Asp Ser Pro Thr Leu 

50 

(2) INFORMATION FOR SEQ ID NO: 50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50 

Tyr Arg Leu Ser Val Glu lie Tyr Ala Arg Arg Glu Tyr Ser Arg Phe 
15 10 15 

Glu Lys Glu Gin Gin Gin Leu Asn Trp Lys Gin Asp Ser Asn Pro Leu 
20 25 30 

Tyr Lys Ser Ala lie Thr Thr Thr lie Asn Pro Arg Phe Gin Glu Ala 
35 40 45 

Asp Ser Pro Thr Leu 
50 

(2) INFORMATION FOR SEQ ID NO : 5 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 

Tyr Arg Leu Ser Val Glu lie Tyr Asp Arg Ala Glu Tyr Ser Arg Phe 
1 5 10 15 

Glu Lys Glu Gin Gin Gin Leu Asn Trp Lys Gin Asp Ser Asn Pro Leu 
20 25 30 

Tyr Lys Ser Ala lie Thr Thr Thr lie Asn Pro Arg Phe Gin Glu Ala 
35 40 45 

Asp Ser Pro Thr Leu 

50 
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(2) INFORMATION FOR SEQ ID NO: 52 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 3 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 52 
GCGGTCATAG ATTTCAAACG AGAGCCGGTA TCC 3 3 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
TTCCCGGCGG TCATAGAATT C CACCGAG AG CCG 3 3 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 
GTATTCCCGG CGGTCAAAGA TTTCCACCGA GAG 3 3 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
ACTGTATTCC CGGCGCGCAT AGATTTCCAC CGA 3 3 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 



BNSOOCID: <WO 9725423A1 J_> 



WO 97/25423 



PCT/US97/00100 



- 74 - 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO; 56: 
AAAGCGACTG TATTCCGCGC GGTCATAGAT TTC 33 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
CCCGAATTCA CAGGCCCCCG TCTCGTC 2 7 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

CCCGAATTCC TCGAGTCAGG GCACCACAAC GCGGTAG 3 7 

(2) INFORMATION FOR SEQ ID NO: 59: 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 31 base pairs 
{B> TYPE: nucleic acid 
£C> STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
CCCCCTCGAG GCTACTGCAT C CG CTTTGTT C 31 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 60: 
CCCCTCGAGT CAGTAAGCAG ACACCAAGCC 3 0 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:61: 
CCCCTCGAGC CAGCCTCTTT TGCAGTC 2 7 

(2) INFORMATION FOR SEQ ID NO:62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
CCCCTCGAGC CAGCCGAATT CAGTATC 27 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
CCCCTCGAGT CACGCCCCCT TGGCCCCCTT C 31 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
CCCCCTCGAG GCGGCACGGG ACTCGAAGGG 30 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
CCCCTCGAGT TAAGGCACTG TGACATG 2 7 
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WHAT IS CLAIMED IS: 

1. A purified and isolated FLP-1 polynucleotide 
encoding the human FLP-1 amino acid sequence set out in SEQ ID NO: 2. 

2. The polynucleotide of claim 1 which is a DNA 

molecule. 

3. The DNA of claim 2 which is selected from the 
groups consisting of cDNA, genomic DNA, partially synthesized DNA, 
and wholly synthesized DNA. 

4. A DNA molecule comprising the human FLP-1 
polypeptide encoding sequence set out in SEQ ID: 1. 

5. A DNA molecule encoding a FLP-1 polypeptide 
selected from the group consisting of: 

a) the human DNA sequence set out in SEQ ID NO: 1 ; 

b) a DNA molecule which hybridizes under stringent 
conditions to the noncoding strand of the protein coding portion of (a); and 

c) a DNA molecule that would hybridize to the DNA of 
(a) but for the degeneracy of the genetic code. 

6. A DNA expression construct comprising the DNA 
molecule of claim 2, 4, or 5. 

7. A host cell transformed or transfected with the 
expression construct of claim 6. 
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8. A method for producing a FLP-1 polypeptide 
comprising growing the host cell of claim 7 in a suitable medium and 
isolating a FLP-1 polypeptide from the host cell or the medium of its 
growth. 

9. A purified and isolated FLP-1 polypeptide having the 
amino acid sequence set out in SEQ ID NO: 2. 



10. An antibody which specifically binds to FLP-1, 



11. The antibody of claim 10 which is a monoclonal 

antibody . 

12. The antibody of claim 10 which is a polyclonal 

antibody. 

13. The antibody of claim 10 which is a recombinant 

antibody. 



14. An anti-idiotype antibody which specifically binds to 
the monoclonal antibody of claim 1 1 . 



15. A hybridoma cell line producing the monoclonal 
antibody of claim 1 1 . 



16. A cell line producing the recombinant antibody of 

claim 13. 
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17. A method for identifying a compound that modulates 
binding between FLP-1 and 0 7 integrin comprising the steps of: 

a) contacting FLP-1 or a fragment thereof, with # 7 
integrin or a fragment thereof; 

b) measuring binding between FLP-1 or a fragment 
thereof, and 0 7 integrin or a fragment thereof; 

c) measuring binding between FLP-1 or a fragment 
thereof, and /3 7 integrin or a fragment thereof in the 
presence of a test compound, and 

d) comparing the measurement in step (b) and the 
measurement in step (c) wherein a decrease in binding 
in step (c) indicates the test compound in an inhibitor 
of binding, and an increase in binding in step (c) 
indicates the test compound is an activator of binding. 

18. A method for isolating a polynucleotide encoding a 
protein that binds to FLP-1 comprising the steps of: 

a) transforming or transfecting appropriate host cells 
with a DNA construct comprising a reporter gene 
under the control of a promoter regulated by a 
transcription factor having a DNA-binding domain 
and an activating domain; 

b) expressing in said host cells a first hybrid DNA 
sequence encoding a first fusion of part or all of FLP- 
1 and either the DNA binding domain or the 
activating domain of said transcription factor; 

c) expressing in said host cells a library of second 
hybrid DNA sequences encoding second fusions of 
part or all of putative FLP-1 binding proteins and the 
DNA binding domain or activating domain of said 
transcription factor which is not incorporated in said 
first fusion; 
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d) detecting binding of an FLP-1 binding protein to 
FLP- 1 in a particular host cell by detecting the 
production of reporter gene product in said host cell; 
and 

e) isolating second hybrid DNA sequences encoding 
FLP-1 binding protein from said particular host cell. 



BNSOOCID: <WO 97254234 1J_> 



INTERNATIONAL SEARCH REPORT 



iter mai Application No 

Cl/US 97/00100 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 6 C12N15/12 C12N5/10 
GO 11*33/59 



C07K14/47 C07K16/18 C12Q1/68 



According to International Patent qassificaoon (IPQ or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by dasafi cation symbols) 

IPC 6 C12N C07K G01N 



Documentation searched other than minimum documentation to the extent that such documents art included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ' Citation of document, with indication, where ap prop n ate. of the relevant passages 



Relevant to claim No. 



THE JOURNAL OF CELL BIOLOGY, 

vol. Ill, September 1990, 

pages 1089-1105, XP000673362 

G0RLIN, J.B. , ET AL . : "HUMAN 

ENDOTHELIAL ACTIN-BINDING PROTEIN 

(ABP-280, NONMUSCLE FILAMIN): A MOLECULAR 

LEAF SPRING" 

see the whole document 

W0 92 13000 A (AM RAD CORP LTD) 6 August 
1992 

see the whole document 



-/■ 



5-8, 
10-15 



5-8 



Further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



* Special categories of a ted documents : 

*A* document defining the general state of the art which is not 

considered to be of particular relevance 
*E" earlier document but published on or after the international 

filing date 

"L* document which may throw doubts on priority claim(s) or 
which is a ted to establish the publication date oi another 
citation or other special reason (as specified) 

'O* document referring to an oral disclosure, use, exhibition or 
other means 

"P" document published prior to the international filing date but 
later than the priority date claimed 



*T* later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X* document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

*Y* document of particular relevance; the claimed invention 
cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

'&.' document member of the same patent family 



Date of the actual completion of the international search 



7 May 1997 



Date of mailing of the international search report 



.1 05. 97 



Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Paientlaan 2 
NL - 2280 HV Rijswijk 
Tel. ( + 31-70) 340-2040. Tx. 31 651 epo nl, 
Fax:( t 31-70) 340-3016 



Authorized officer 



Holtorf, S 



Form PCT-1SA-210 {second sheet) (July 1992) 



page 1 of 2 



BNSDOCID: <WO 9725423A1J_> 



INTERN ASDNAL SEARCH REPORT 



pcWs 



Application No 

r S 97/00100 



C .(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 


Category * 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 



CELL, 

vol. 76, 28 January 1994, 

pages 301-314, XP002030607 

SPRINGER, T.A.: "TRAFFIC SIGNALS FOR 

LYMPHOCYTE RECIRCULATION AND LEUKOCYTE 

EMIGRATION : THE MULTISTEP PARADIGM" 

cited in the application 

see the whole document 



1-18 



2 



Form PCT ISA/21 0 {continuation of second thut) (July 1991) 
BNSDOCID: <WO 9725423A1 J_> 



page 2 of 2 



INTERNATIONAL SEARCH REPORT 



rmaaon on patent family members 



te orval Application No 

(>Ci/US 97/00100 



Patent document 
cited in search report 



Publication 
date 



Patent family 
member(s) 



Publication 
date 



WO 9213000 A 



06-08-92 



NONE 



Form PCT. JSAv'210 (piiem fwnily anft«0 (July 1993) 



BNSOOCID: <WO 9725423A 1 J_> 



